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Abstract: This work develops change-point methods for statistics of high-frequency data. The 
main interest is in the volatility of an Ito semi-martingale, the latter being discretely observed over 
a fixed time horizon. We construct a minimax-optimal test to discriminate continuous paths from 
paths comprising volatility jumps. This is embedded into a more general theory to infer the smooth¬ 
ness of volatilities. In a high-frequency framework we prove weak convergence of the test statistic 
under the hypothesis to an extreme value distribution. Moreover, we develop methods to infer 
changes in the Hurst parameter of fractional volatility processes. A simulation study demonstrates 
the practical value in finite-sample applications. 
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1. Introduction 

Change-point theory classically focuses on detecting one or several structural breaks in the trend of 
time series. Statistical methods to infer change-points have a long and rich history, dating back to the 
pioneering work of Page (1955). Prominent approaches as e.g. by Hinkley (1971), Pettitt (1980), An¬ 
drews (1993) or Bai and Perron (1998), among many others, provide statistical tests for the hypothesis 
of no change-point against the alternative that changes occur. Moreover, they allow for localization of 
change-points (estimation) and confidence intervals. Change-point methods usually rely on maximum 
statistics and exploit limit theorems from extreme value theory; see Csorgo and Horvath (1997) for 
an overview. Less focus has been laid on discriminating jumps from continuous motion in a nonpara¬ 
metric framework. Important exceptions are Muller (1992), Muller and Stadtmuller (1999), Spokoiny 

’Financial support from the Deutsche Forschungsgemeinschaft via SFB 649 Okonomisches Risiko and FOR 1735 Struc¬ 
tural Inference in Statistics: Adaptation and Efficiency is gratefully acknowledged. 

^We thank Marc Hoffmann for helpful remarks on testing hypotheses of smoothness classes. We also thank two anony¬ 
mous referees for valuable comments on an earlier version. 
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(1998) and Wu and Zhao (2007) in the framework of nonparametric regression analysis. The latter 
serves as an important point of orientation for this work. 

Statistics of high-frequency data is concerned with discretizations of continuous-time stochastic 
processes, most generally Ito semi-martingales. The continuous part of an Ito semi-martingale is of 
the form 


X t = X o + 


: ds + 


a, 


dW* 


( 1 ) 


defined on a filtered probability space (Q, F, (Ty), P) with a standard (J r f )-Brownian motion W and 
adapted drift and volatility processes a and a. One key topic is statistical inference on the volatility 
under high-frequency asymptotics when the mesh of a discretization on a fixed time horizon tends to 
zero. There is a vast body of works related to this problem and its economic implications; see e.g. An¬ 
dersen and Bollerslev (1998), Mykland and Zhang (2009) and Jacod and Rosenbaum (2013), among 
many others. Statistics for a discretized continuous-time martingale is closely related to Gaussian cal¬ 
culus as highlighted by Mykland (2012) what is also at the heart of our analysis. Many contributions 
evolve around the question if jumps are present in the Ito semi-martingale modeling the log-price of a 
financial asset; see Art-Sahalia and Jacod (2009) for a statistical test. 

A more involved problem which is of key interest for economics and finance is to infer the smooth¬ 
ness of the underlying stochastic volatility process and to check whether volatility jumps occur. In 
particular, inference on volatility jumps allows to investigate the impact of certain news arrivals on fi¬ 
nancial risk. A first empirical study by Tauchen and Todorov (2011) indicates that volatility jumps can 
occur but, due to the lack of statistical methods, has been based on direct observations of the VIX, the 
most prominent available volatility index. Further contributions consider joint price-volatility jumps. 
Jacod and Todorov (2010) have designed a test to decide from high-frequency observations if contem¬ 
poraneous jumps of an Ito semi-martingale and its volatility process have taken place at least once over 
some fixed time interval. These methods do not generalize to test directly for volatility jumps without 
restricting to a finite set of large price adjustments first. One main profit from our change-point analy¬ 
sis of high-frequency data is a general test for volatility jumps. Moreover, results on estimation of the 
time of a volatility jump are provided. 

As an example, we illustrate in Figure 1 the evolution of log-prices of two blue-chip stocks, 3M and 
GE, over the NASDAQ intra-day trading period (6.5 h rescaled to the unit interval) on March 18th, 
2009. We consider one minute returns from executed trades 1 to ensure the semi-martingale model is 
adequate and limit a manipulation by market microstructure frictions. Available tests and criteria do 
not identify price adjustments so large to be ascribed to jumps such that the test by Jacod and Todorov 
(2010) is not applicable. It seems as if a common source of news drives price dynamics at the end of 
that day concertedly. The picture becomes much clearer when focusing on the estimated spot squared 
volatilities in Figure 1, for which we average at each time point the previous 20 rescaled squared 
returns. This example suggests that volatility dynamics vary over time. Here, the volatilities of both 
assets sky-rocket at exactly the same time. This common volatility jump coincides in time with a press 
release at 02:15 p.m. EST subsequent to a meeting of the Federal Open Market Committee. The time 
is marked in Figure 1 by the dashed lines. In light of increasing economic slack, the FOMC announced 
“to employ all available tools to promote economic recovery and to preserve price stability” 2 , includ¬ 
ing a guarantee for an exceptionally low level of the federal funds rate for an extended period and a 
considerable increase of the size of the Federal Reserve’s balance sheet. The mathematical concepts 
developed in this work provide a novel device for assessing volatility dynamics and jumps. 

'reconstructed from the order book using LOBSTER, https : / /lobster . wiwi . hu-berlin . de/ 

2 source: www . federalreserve . gov/monetarypolicy/fomcminutes20090318 . htm 
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FIGURE 1. Log-price intra-day evolutions (top) and estimated spot squared volatilities (bottom) for MMM (left) and GE 
(right) on March 18th, 2009. 


Change-point methods for volatility in a time-series environment, which is quite different to our 
high-frequency semi-martingale setting, have been discussed by Spokoiny (2009). Quasi-likelihood 
estimation of a change-point in a diffusion parameter in a high-frequency setting has been considered 
by Iacus and Yoshida (2012), pointing out already one very useful bridge between change-point theory 
and high-frequency statistics. Our main focus is on testing for the presence of changes in a general 
setup exploiting localization techniques. Beyond the analysis of possible jumps of volatility there is 
great interest in the smoothness regularity of volatilities; see e.g. Gatheral et al. (2014) for a recent 
work, not least because of its crucial role for setting up volatility models. 

We focus on volatilities which are almost surely locally bounded and strictly positive adapted pro¬ 
cesses. For our testing problem we consider classes of squared volatilities 

E(a, L n ) = {(of )t e[0i i] | sup |of - a 2 s \ < L n 5 a ) , (2) 

s,iS[0,l],|s—1|<<5 ^ 

for an appropriate sequence L n converging to infinity; cf. Assumption 3.1 for a precise statement 
about the conditions under the null hypothesis. The regularity exponent a > 0 is the key parameter to 
describe the null hypothesis Hq. We may now more formally ask the following questions: 

(i) Is there a jump in the volatility, i.e. Acr| = (cr| — lim s ^ of) > 0 for some 9 E (0,1)? 

(ii) Does volatility get rougher in the sense of a regularity exponent a' < a on ( 9 ,1]? 

Question (i) poses a local problem, whereas question (ii) entails a local or global problem. In 
particular, one single jump is a discontinuity point which is not informative about the volatility’s 
smoothness elsewhere. More general than jumps, our theory to address the local problem includes 
abrupt yet continuous adjustments of the volatility over a short time period. This is an example of a 
local change of regularity, where a drops to 0 < a' < a for a short period of time before attaining its 
original value again. In this framework, the case of a jump corresponds to a' = 0. 
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On the other hand, at some time price fluctuations may considerably increase or decrease and this 
rougher behaviour persists permanently over the remaining time interval. Then, we witness a global 
change. As a key example, our methods devoted to the global problem allow to infer changes in the 
Hurst parameter in a fractional volatility model. Naturally, as presumably in Figure 1, both local and 
global events may occur simultaneously. This work presents methods to test local and global alterna¬ 
tives, both relying on different foundations. A desirable property for a global approach is robustness 
with respect to local changes which is crucial to distinguish the two problems. This means in particular 
that a test statistic to decide the question of global changes should not be affected by a fixed number 
of jumps, which may be interpreted as a part of the hypothesis of no global change. The other way 
round, an approach to test for local changes may very well be affected by a global change as well, for 
instance at the very time of change or by an exceptionally large fluctuation. 

We consider minimax-optimal testing and estimation in both situations covering broad classes of 
volatility processes. For a conceptual introduction to minimax-optimal tests, let us focus first on dis¬ 
criminating smooth volatilities in the sense of (2) from volatilities with at least one jump. From a 
statistical perspective, the key question is which sizes of volatility jumps can be detected. For ex¬ 
ample, it is clear that we cannot detect jumps of arbitrarily small size. Loosely speaking, if we say 
that ‘no jump’ is our null hypothesis Hq and ‘there is a jump’ is our alternative Hi, then we face the 
problem of distinguishability between // () and H \. The minimum size b n of a jump AoJ, such that we 
are still able to uniformly control the type I and type II errors, is called detection boundary. If we are 
interested to test for the presence of jumps, we are thus led to consider for 9 £ (0,1) alternatives of 
the form 


Sq (a, b n , L n ) = j(of) te[(U] |(of - Acr 2 ) te[01] € E(a,L n ); |Acr|| > b n j (3) 

with a decreasing sequence b n . We then address the testing problem 

#o : K 2 H) te[ o,i] G £(a,L„) vs. Hi : 3 9 e (0,1) with (of (w)) te[0il ] € S/(a, b n , L n ). (4) 

In this context 9 is commonly referred to as a change-point. The test alternative means that we demand 
at least one jump but do not exclude multiple jumps. The dependence on ui in (4) is natural in the 
definition of the hypotheses, as different realizations might lead to different paths on [0,1], 

For the testing problem (4), we establish the minimax-optimal rate of convergence under high- 
frequency asymptotics. We follow the notion of minimax-optimality of statistical tests from the sem¬ 
inal contributions of Ingster (1993). For tests ip that map a sample X„ to zero or one, where ip ac¬ 
cepts the null hypothesis Hq if ip = 0 and rejects if -0 = 1, we consider the maximal type I error 
Oty(a) = sup CT 2 eS ( o Z/ri ) P CT (^ = l) and the maximal type II error 

Pi’(a, bn) = sup ee(01 ) sup 0 . 2 e5 J (a)bn>Ln) P a (ip = 0) and define the global testing error as 


7i> (a b n ) — otyj (n) 4“ (ct, b n ). 


(5) 


The primary interest now lies on tests that minimize yy (a, b n ), given the boundary b n . We aim to find 
sequences of tests ip n and boundaries b n with the property that 

7y>n(Mn) -A o as n-s-oo. 

The smaller b n > 0, the harder it is for a test to control the global testing error, i.e. to distinguish 
between Hq and Hi. It is thus natural to pose the question given a, what is the minimal size of ^ 0 
such that 

lirn inf yy (a, b n ) = 0 

7T—>-00 \jj 


( 6 ) 
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holds? The optimal b"!’ 1 is called minimax distinguishable boundary, and a sequence of tests '(p n 
that satisfies (6) for all b n > !>T minimax-optimal. If L in (2) is constant, we prove that b n oc 


— a 


(n/ log(n)) 2o + 1 constitutes the minimax distinguishable boundary for testing (4) and our constructed 
test is eligible to attain minimax-optimality. If L n is indeed a sequence, the rate only slightly changes; 
see Section 4.1 for precise results. 

For the lower bound proof we simplify the problem by information-theoretic reductions passing to 
more informative sub-classes of the parameter space. The lower bound established for the sub-class 
then serves a fortiori as a lower bound in the more general and less informative model. After gradually 
transforming the problem by showing strong Le Cam equivalences of the considered sub-experiment 
to more common situations with i.i.d. chi-square and Gaussian variables, the lower bound is proved 
by classical arguments based on the theory in Ingster and Suslina (2003). 

The paper is organized as follows: Section 2 serves as an illustration for the benefit of cusum-based 
statistics in the simple, yet important model of a continuous Ito semimartingale with constant volatil¬ 
ity. This illuminates the connection of classical change-point methods and high-frequency statistics. 
More involved, but also more important in practice is the case where the volatility is both time-varying 
and random. Section 3 is devoted to this nonparametric local problem. As the volatility process is la¬ 
tent, which requires estimation based on smoothed squared increments of the semi-martingale, this 
poses an intricate statistical problem which to the best of the authors’ knowledge had not been ad¬ 
dressed so far. We establish a consistent test and derive a limit theorem under the hypothesis. The 
asymptotic analysis utilizes nonparametric change-point theory, stochastic calculus and bounds on 
the approximation error in the invariance principle. Our test allows to distinguish paths with jumps 
from continuous paths under remarkably general smoothness assumptions on the hypothesis. In Sec¬ 
tion 3.2 we discuss the situation in which the underlying Ito semimartingale might have jumps as 
well. Section 4.1 provides the theory on minimax-optimality with the lower bound, while Section 4.2 
deals with the estimation of the location of the change in volatility under the alternative. Finally, a 
minimax-optimal nonparametric test for the global problem is established in Section 5. In practice, it 
is attractive to interconnect both methods which complement each other. A simulation study that in¬ 
vestigates the finite-sample performance of the proposed methods and discusses some practical issues 
can be found in Section 6. All proofs are postponed to the Appendix. 

2. Change-points in a parametric volatility model 

Arguably, the simplest model of a continuous-time Ito diffusion X is the case of no drift and a constant 
volatility, such that X is given by 



(V) 


where W denotes a standard Brownian motion. Throughout this work, the underlying process X is 
recorded at discrete regular times iA n with a mesh A n —>• 0. To keep the notation uncluttered, we 
assume to be on the fixed time interval [0,1] and set n = A” 1 E IN, so that we have observations 

XiA n ,i = 0,... ,n. 

Inference on the squared volatility a 2 is usually based on increments A™X = X,;/y„ — Xu_i^ n . 
In case one is interested in changes in the volatility, a natural quantity to discuss is the cusum statistic 
which reads 



( 8 ) 
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FIGURE 2. Left: One realized path of X with structural break in volatility att = 1/2. Right: Empirical results for the test 
statistic from 10000 iterations under both alternative and hypothesis. 


In order to derive the asymptotics of the cusum statistic, recall the functional (stable) central limit 
theorem for the realized volatility from observations of a continuous Ito semi-martingale (1) by Jacod 
(1997). Under mild assumptions, we have 



(A?X) 2 - 


a s ds 


\J~2i 


a~ dB. 


,t€ [0,1] 


(9) 


as n —> oo weakly in the Skorokhod space with a standard Brownian motion B independent of W. In 
particular, if a s = a is constant, this result directly implies 


Sn,[nt\ f{B t - tBi) , (10) 

with 7 2 = limn^oo n Var( Ya=i i A ? x Y) = 2a 4 , which coincides with a standard cusum limit 
theorem in the vein of Phillips (1987). The quarticity estimator by Barndorff-Nielsen and Shephard 
(2002), 7 2 = (2n/3) i 4 , may be used to obtain a self-normalizing version: 

(? n v 1/2 

f ¥ E ( A ? X ) 4 ] S n,lnt] -+B t -tB 1 , (11) 


where the limit process is a standard Brownian bridge. Testing for jumps (resp. structural breaks, 
change-points) of the volatility is then pursued based on 


T n 


sup 


(f) 1/2 5 , 


( 12 ) 


as the test statistic which (under the null that the volatility is constant) tends as n —> oo to a 
Kolmogorov-Smirnov law; see Marsaglia et al. (2003). Under the alternative T n diverges almost 
surely. 

Figure 2 shows an example in which we observe n = 10000 values of a standard Brownian motion 
under the hypothesis, while under the alternative the volatility jumps in t = 1/2 from 1 to 1.1. Out 
of 10000 Monte Carlo iterations for hypothesis and alternative, only 21 realizations of (12) under the 
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hypothesis are larger than the mi nimum under the alternative. The other way round, in 11 iterations 
the values under the alternative fall below the maximum of the generated values from the hypothesis. 
The cusum approach hence clearly allows to separate hypothesis and alternative here, even for the 
relatively small volatility jump which is not readily identifiable from the path of X in Figure 2. 

This is illustrated within the histogram in Figure 2. The left part stems from realizations under the 
hypothesis which closely track the asymptotic Kolmogorov-Smirnov law. The right part instead is due 
to realizations under the alternative. For larger volatility jumps the right part moves further to the right 
such that the two distributions separate even more clearly. This test based on (12) of Kolmogorov- 
Smirnov type permits to test the hypothesis of a constant volatility against structural breaks in an 
efficient way. 

Beyond this bridging of classical change-point analysis and structural breaks in a parametric volatil¬ 
ity model, our main focus in the sequel is nonparametric: to distinguish volatility jumps from a con¬ 
tinuous motion of volatility or to identify changes in the regularity exponent. 

3. Nonparametric change-point test for the volatility against jump alternatives 
3.1. Construction and limit behavior under the hypothesis 

Suppose we observe a continuous Ito semi-martingale (1) at the regular times iA n , i = 0,..., n E IN. 
In this setting, we want to construct a test for (4). With the volatility process being time-varying, it 
becomes apparent from (9) that the test statistic (12) is not suitable to test Ho against H\. Our core 
idea is to utilize local two-sample f-tests over asymptotically small time blocks instead. As a first test 
statistic, we consider 


Fn — max \HVn,i / HVn,i-\-l 1|; 

i=0,...,[n/k n \— 2 

where k n —tools an auxiliary sequence of integers depending on n and 


RV «.‘ = F 

7 = 1 


ik n +j 


X) 2 ,i = 0,..., [n/k n \ - 1, 


(13) 


(14) 


is a rescaled local version of realized volatility over blocks of the partition [ik n A n , (i + l)k n A n \. 
The RV n> i estimate a block-wise constant proxy of the spot volatility <Tik n A n on the respective blocks. 
Asymptotic properties of RV n ,i were e.g. derived in Alvarez et al. (2012). A large distance between 
RV Uj i and R.V n ,i+i suggests the presence of a jump or unsmooth breaks in the volatility close to time 
ik n A n . In order to obtain normalized statistics we work with ratios instead of differences. Identifying 
breaks, in particular jumps, from large deviations between the ratios of two successive local volatility 
estimators and one, V n appears to be a reasonable test statistic for our problem. 

Our second test statistic is of the same nature as (13), but instead of non-overlapping blocks it takes 
into account all overlapping blocks of k n increments: 


V* = max 

i=k„,...,n-k n 


£EU + ,(A”i) ! 


et eS; 1 (a”x) : 


- 1 


jj=i+l 


(15) 


In comparison to nonparametric change-point approaches like the one by Wu and Zhao (2007), both 
statistics (13) and (15) are based on ratios rather than differences. This makes sense intuitively, since 
we are not dealing with the typical additive error structure of time series models. In our setting, 
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we have e.g. n{ A/26) 2 ~ <t 2 a , i = 1 ,n, with i.i.d. x^-distributed random variables so 
that the volatility er plays the role of a multiplicative error. Therefore, by computing ratios first, we 
basically deal with a maximum of identically distributed variables in the asymptotics. This is of key 
importance to obtain a distribution free limit under the hypothesis. 

In order to discuss the asymptotics of V n and V* under the null hypothesis we need a couple of 
additional assumptions, all of which are rather mild and are covered by a variety of stochastic volatility 
models. 

Assumption 3.1. The following assumptions on the processes a and a are in order: 

(1) a and a are locally bounded processes. 

(2) a is almost surely strictly positive, i.e. inf te [o,i] °t > cr 2 > 0. 

(3) On the hypothesis, Of c 0, the modulus of continuity 

ws(a)t = sup{|(j s - <7 r | : \s - r\ < 5} 

s,r<t 

is locally bounded in the sense that there exists a > 0 and a sequence of stopping times T n —> oo 
such that wg(a) (t„ai) < L n 5 a , for some a > 0 and some (a.s. finite) random variables L n . 

In particular this implies of E £(a, L n ) for some n almost surely on Of. To consider sequences 
L n becomes important when developing lower bounds, see the first paragraph of Section 4.1 for a 
detailed explanation. We choose the sequence k n —> oo, as n —> oo, such that the following growth 
condition holds: 


K 1 + s/K(k n A n ) a y/log(n) 0, (16) 

for some e > 0 and with a > 0 from Assumption 3.1 (3). 

There are two conditions contained in (16). First, k n —> oo faster than some power of n which is a 
mild lower bound on the growth of k n as n -» oo. This ensures consistency of the estimates (14). The 
second condition gives an upper bound related to the continuity of a. Naturally, the smaller a (and the 
less smooth a), the smaller we have to choose the size of the blocks over which we estimate a. 

Theorem 3.2. Set m n = \n/k n \ and y mn = [41og(m n ) — 2 log(log(m n ))] 1//2 . If Assumption 3.1 
holds and k n satisfies condition (16), then we have on i V (under Hq) 

y/\og(m n ) ( (kl/ 2 / v/2) V n - lmn ) ^ V,, (17) 

s/\og{m n ) (k l J 2 /V2) V* - 2 log (m n ) - ^ log log (m n ) - log (3) Ak, (18) 

where V follows an extreme value distribution with distribution function 

P(y < x) = exp(—7T -1 / 2 exp(— x)). (19) 

Remark 3.3. It is remarkable that Theorem 3.2 in combination with condition (16) allows asymp¬ 
totically to distinguish between volatility paths with jumps and volatility paths without jumps, where 
we only require some granted smoothness a > 0 in Assumption 3.1 (3). Note that less smooth paths 
require smaller block lengths k n by (16) which reduces the rate in Theorem 3.2 and the power of the 
test. Most importantly, we can cope with standard models for a. For a continuous semi-martingale 
volatility, we have a « 1/2. In this case, we take k n oc n 1 / 2 for e > 0 and e small to preserve the 
highest possible power. Similarly, for a Lipschitz volatility, i.e. a = 1, one might choose k n oc n 2 / 3-e . 
Thus, the choice of the block length is close to optimal window sizes for spot volatility estimation. □ 
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As we show in Theorem 4.2 that V* and V n diverge under the alternative almost surely, Theorem 
3.2 provides a consistent test with asymptotic power 1 by critical values from the limit law under the 
hypothesis. 


3.2. A test in presence of jumps in the observed process 


In order to provide a valid approach for various economic applications an important aim is to account 
for possible jumps in the process X as well. Thereto, consider a general Ito semi-martingale 


X t 


X 0 + a s ds+ (T s dW s + / / k(S(s,x))(p - is)(ds,dx) 

Jo Jo Jo Jr 

+ / / R(5(s,x))p(ds,dx ), 

Jo Jr 


( 20 ) 


where a truncation function n, Rjx) = x — k(x), separates large from compensated small jumps. The 
compensating intensity measure u of the Poisson random measure p admits the form u{ds,dx) = 
ds <S> A (dx) for a cr-finite measure A. Our notation follows Jacod (2008). 

Assumption 3.4. Grant Assumption 3.1 for the continuous part of X. Suppose sup^ x |5(s, x)\/y(x) 
is locally bounded for some deterministic non-negative function 7 which satisfies for some r < 2: 


/ (1 A y r (x))A(dx) < 00 . (21) 

Jr 

In condition (21), r is a jump activity index that bounds the path-wise generalized Blumenthal- 
Getoor index from above. Imposing r < 1 restricts to jumps of finite variation and r = 0 to finite jump 
activity. To develop test statistics which are robust against jumps we employ a truncation principle as 
introduced for integrated volatility estimation by Mancini (2009) and Jacod (2008). The analogue of 
(13) with truncated squared increments reads 


Vn,u n — n i La j) \TRVn,u n f/TRVn,u„,i+l l|j 
l —0 j • • • 5 |_T 2 j Kpi J 2 


TRV n 


n 


£(A 3 m,-*) ^-{\A? kn+j x\<u n } , * — 0 ,..., [n/k n \ 1 . 


3 = 1 


( 22 ) 

(23) 


The truncation sequence u n oc n T , r 6 (0,1/2), is used to exclude large squared increments which 
can be ascribed to jumps. In the same way we can generalize statistic (15) with overlapping blocks: 


V* = 

v n,u n 


max 
i=k n ,...,n—k n 


ifT!j=i-k n + i( a "20 2i {|a«xi<m 


tEai( A F) 2 i { | A Fi<M 


- 1 


(24) 


We prove below that truncation is an appropriate concept to asymptotically eliminate the influence by 
jumps, at least under certain restrictions on the jump activity, on k n and on r. In particular - , under the 
hypothesis we obtain the same limit behaviour of the test statistics as in Theorem 3.2. 

Proposition 3.5. Suppose k n oc n/ 1 for 0 < fi < 1, such that condition (16) is satisfied. Furthermore, 
grant Assumption 3.4 for some 


r< min ^2(2 — r 1 (1 —/3/2)),r 1 min(l/2,1 — /3)^j 


(25) 
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as well. Then, with m n = \n/k n \ and 7 mn = [4log(m n ) — 2 log(log(m n ))] 1//2 as before, and if 
either r = 0 or the jump process is a time-inhomogeneous Levy process, we have on I V ( under Hq) 
for the statistics (22) and (24) the weak convergences 

v/log(m n ) (( kl/ 2 /V2) V n , Un - 7m „)A V, (26) 
\/l°g (m n ) {k}J 2 /V2) V* }Un -2 log (m n ) - ^ log log (m n ) - log (3) -A V, ill) 

where V is distributed according to (19). 

Remark 3.6. A simple computation shows that a necessary condition in order for (25) to hold is r < 1, 
but obviously it involves further conditions on the interplay between r, /3 and r. This restriction on 
the jump activity is stronger than the usual r < 1 for truncated realized volatility as in Jacod (2008), 
and it is due to the maximum in (24) compared to linear estimators. 

Remark 3.7. Arguably, it is most relevant from an applied perspective that the test based on (24) 
copes with finite activity jumps. In this case (25) reads as r > 1/2 — /3/4, and the only requirement 
is that u n is not chosen too large. Beyond the finite activity case, the choice of the tuning parameters 
becomes more complex and depends on the statistician’s interest. Ideally, one would choose (3 large to 
secure a high power, but it should not become too large as (25) then is more restrictive and interesting 
models on the jumps might be ruled out. As an equilibrium choice /3 « 1/2 is recommended in which 
case t ~ 1/2 is optimal, leading to the condition r < 1. Choosing t close to 1/2 and u n small 
improves the precision of localized truncated realized volatilities (23). As an exact choice for u n one 
typically picks it sufficiently large to just not interfere with the continuous component of Xf. Based 
on extreme value theory for Gaussian sequences, 

u n = C\j2 log(n)n _1//2 (28) 

with constant C is an accurate choice. Once it is guaranteed that C > sup /e - 01 - of, almost surely no 
increments of the continuous part of (20) are truncated. In practice, some suitable upper bound C for 
the volatility can be obtained from historical data. □ 

4. Asymptotic minimax-optimality results for the local change problem 
4.1. Consistency and minimax-optimal rate of convergence 

In this section it becomes important that stochastic squared volatility processes lie under Hq in 
£(a, L n ), defined in (2), where we take into account strictly positive increasing sequences L n . This is 
crucial as we cannot describe the random processes as members of a fixed Holder class. If of satisfies 

E[|of — off] < C\t — s| 7+tm for some b, C > 0 and 7 > 1 , 

then the Kolmogorov-Centsov Theorem implies that lim^^oo P( (o - /) o<t<i £ £(ct, L n )) = 1, pro¬ 
vided L n —y 00 arbitrarily slowly. Hence, up to a negligible set, X(a, L n ) contains the paths generated 
by a huge number of popular volatility models when considering L n —> 00 . On the other hand, if L is 
fixed, we are in the familiar framework of Holder classes. 

At this stage, we integrate alternatives where the volatility is less smooth than under the hypothesis, 
but which require not necessarily jumps. The statistical devices developed above may be applied 
to discriminate Hq from alternatives without jumps where until some change-point 6 e [0,1), the 
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process (< 7 2 A g) behaves as a process in £(a, L n ). After 9, the regularity exponent drops to some 
0 < a' < a. Since S(a, L n ) C T>(a',L n ), we require functions that ‘exploit their roughness’ in 
a certain sense. Processes in the alternative set may be smoother on parts of the interval, but we 
require that they ‘exploit their roughness’ somewhere on [ 0 , 1 ) (close to 6 respectively), such that in 
particular (of )te[o,i] 0 E(a, L n ) ■ To describe the alternative sets, define 

We then express the set of possible alternatives 


cR / 


(a, a , b n , L n { — | (ctAe) t£[o,i] ^ Tl(a , L n )| 


inf cr| > b r 

\h\<2k n A n 


or 


sup 

\h\<2k n A 1 


a 6 < — 


and consider the testing problem 


Ho : (of (w))te[o,i] e Ti(a, L n ) vs. Hf : 3 0 e [0,1) |(of (w))te[o,i] F <S^(a, a', b n , L n , k n ). (29) 

Since k n is selected dependent on L n and a, the dependence of Sjj on k n becomes redundant. De¬ 
manding the exceedance period of the difference quotient in Sf to be at least two block lengths ensures 
that our block-wise comparison in the test statistics (13), (15) is able to detect the roughness, also for 
9 = 0 . 

Let us elaborate on the specific form of the alternative sets. In general, it is impossible to test 
E(o, L n ) against Yi(a!,L n ) for a > a', and it is necessary to consider special subsets of T,(a',L n ). 
Intuitively, it is clear that one needs at least to remove £(a, L n ) from E(a 7 , L n ), but this is not suf¬ 
ficient. In fact, one needs to focus on the functions which exploit their roughness in a certain sense; 
cf. Hoffmann and Nickl (201 1) for a detailed discussion in a related context. Geometrically, this means 
that the functions of interest are those with discontinuities or rough behavior as characterized in Sf 
(or which fluctuate considerably more, like the ones considered in Section 5). However, as the sample 
size n grows, we only require the difference quotient to exceed a level b n that becomes smaller and 
smaller. 

For the testing problems (4) and (29), we first present a negative result, that also serves as a minimax 
lower bound for the problem depicted in ( 6 ). 

Theorem 4.1. Assume that a > a / > 0 and inf / aj > rr 2 > 0. Consider either set of hypotheses 
{Ho, or {H 0 ,H^}. Then for 


b n < (n/ log (m n )) 


2a+l 


{L n ) 


2a' + l „ 
2a+l 


(30) 


with a! = 0 for H\, we have in both cases lim n _ 5 . 00 inf,/, 7 ^(a, b n ) = 1. 

Theorem 4.1 reveals that it is impossible to construct a minimax-optimal test in the sense of ( 6 ) if 
b n is bounded as in (30). Consequently, we deduce that 


bT > (n/ log(m n )) 20+1 (L n ) 


2»' +1 0 
2o+l 


(31) 


In Theorem 4.2 we shall establish a corresponding upper bound up to a constant, and thus (31) already 
gives the optimal rate for the minimax distinguishable boundary. Observe that based on V* from (15), 
we can obtain the following test ?/>°. 


^{(XiA n )o<i<n) = 1, ifV* > 2CV21og (m«)/k*, 


(32) 
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where C 0 > 2 and k 0 = (y/log (m 0 )?i a /L n ) 2a+1 , m 0 n = [n/k 0 \. 


(33) 


Alternatively, one might base a test on V n from (13). 

To simplify the discussion, we restrict to positive volatility jumps, i.e. inf) A at > 0, which appears 
natural from an economic point of view. We point out that an analogue result can be shown for neg¬ 
ative, or positive and negative jumps, which however requires a further technical structural condition 
(that jumps do not cancel each other) in case of multiple jumps in a vicinity for the alternative set. 

Theorem 4.2. Consider (4) with infj Aay > 0, or (29) withO < a! < a < 1 and L n = o((n/k 0 ) a ~ a '). 


¥ 


K> 



n 


/log (m n )) 


2a+l 


(Ln) 


2a' + l 

2a+l 


(34) 


where k^, and C 0 are as in (33), then lim,^^ 7^,0 (a, 6/) = 0. This implies that 


b T (n / log(m n )) 


2a+l 


(L n ) 


2o' + l 

2o+l 


Remark 4.3. If L defined in (2) is a deterministic constant, we get the minimax distinguishable 

— (a—n') 

boundary b n oc (n/log(n)) □ 

Remark 4.4. Volatility paths lying in Sf having (locally) a rough - but still continuous - increase or 
decrease cannot be distinguished from volatility paths with jumps at or below the boundary b n stated in 
(30). Both alternatives H\ and Hf arc in that sense intimately connected. Formally, we cannot include 
Sq in setting a' = 0, since for Sf we demand the roughness to persist over an (asymptotically 
small) interval. 

Let us point out that in this testing problem the union of hypothesis and alternative can not cover 
the set of all possible volatility paths. One situation of interest in which of {5^ U Sq U £(a, L n )} 
is the case of fractional processes with Hurst parameter a' < a. This different situation is addressed 
in Section 5. □ 


4.2. Estimating the change-point 

Once one has opted to reject the null hypothesis of no change, the actual locations of jumps become 
of interest for further inference. This location problem has been extensively discussed in the literature 
in different frameworks; see for instance Csorgo and Horvath (1997) and Muller (1992). 

4.2.1. One change-point alternative 

First, we restrict ourselves to the ‘one change-point alternative’ involving a jump in the volatility, 
i.e. we specify the alternative hypothesis H * as 

H* : |og — of_ I =: 5 n > 0 for a unique 9 e (0,1). 

The jump size 5 n may be fixed or we consider a decreasing sequence ( d n ). To assess the possible time 
of change, we use slightly modified versions of the building blocks of the test statistic V* from (15), 
defined as 

i i+k„ 

E ri(A]X ) 2 - E n(A'jX ) 2 , 
j=i— k„+l j=i +1 


V 0 - = 

v n,x 
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for i = k n ...., n — k n , and Vf i = 0 else. In contrast to the construction of V*, we may employ a 
simpler unweighted version. One can also consider the rescaled versions as in V*, and we conjecture 
that the following theoretical results of these estimators coincide. Here, we switch from ratios to 
differences which simplifies the analysis a bit and is enough to obtain the following properties. The 
possible time of change is then estimated via 


n9 n = argmax,_ 


i=k n ,...,n—kn v n,i 


’ n 


(35) 


The following proposition establishes quantitative bounds for the quality of estimation. 

Proposition 4.5. Assume that the assumptions of Theorem 3.2 hold and that H* is valid. Then, for 
5n > 2fc" 1/2 v /M n) sup te r 0jl i erf, we have that 

(36) 

Remark 4.6. If 5 n does not tend to zero, the condition on 5 n in the proposition is always satisfied. 
The estimator extends to jumps of X using truncation as in (24), and Proposition 4.5 then applies to 
the generalized estimator under the assumptions of Proposition 3.5. In the setup of continuous breaks 
under alternative 5^ the same estimator is consistent (only) when except on a small interval around 9 
the volatility is ci-regular. When the maximal length of this interval is yjk n log(n) /( n5 n ) , (36) applies 
when we replace 5 n by d n (k n A n )“ . Clearly, when the volatility violates a-regularity over longer time 
horizons such a result is not available. □ 

Obviously, the quality of the estimator 6 n depends on the bandwidth k n , and the smaller, the better. 
This is the complete opposite case compared to the test based on statistic V*, where a larger choice 
of k n increases the power. This is no contradiction, since both problems have a different, essentially 
reciprocal nature. Also note that k n cannot be chosen arbitrarily small; see condition (16). 

While classical estimators as the argmax of statistic (8) attain a standard -fn- rate, corresponding 
to k n ~ n, our nonparametric localization approach readily facilitates improved convergence rates as 
known for state-of-the-art change-point estimators, as e.g. in Aue et al. (2009). The following propo¬ 
sition sheds light on optimal convergence rates for the estimation problem. 

_ 2 

Proposition 4.7. On the assumptions of Proposition 4.5 for k n oc (yTog(n)n a J 20+1 , a consistent 
estimator for 6 does not exist in the case that 5 n = o(y/\og(n)k n 1 ^ 2 ). 


4.2.2. Multiple change-point alternatives 

In the sequel, we demonstrate how the previous theory can be extended to multiple change points. To 
keep this exposition at a reasonable length, we focus on the alternative where the volatility exhibits 
jumps. From a general perspective, multiple change-point detection is typically a challenging multiple 
testing problem compared to the one change-point detection problem. The main probabilistic difficulty 
usually lies in controlling the overall stochastic error. Fortunately, in the present context we have 
already successfully dealt with the overall stochastic error, see Theorems 3.2 and 4.2. Thus, treating 
the multiple change-point problem only requires small adjustments. For N G IN, let 

0 < 6i < ... < On < 1, &j v = {0i, • • •, On}- 

We then consider the alternative 

H* : — cfq._ | =: 5 U)l >0 for 1 < i < N. 


(37) 
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The number of changes N in unknown to the experimenter. The goal is to provide uniformly consistent 
estimates for the multiple change points 9 6 0 y. To this end, given an index set X C { k n ,..., n—k n }, 
we define in analogy to (35) 

n9 n (X) = argmaXjgj . (38) 

Based on the test tff introduced in (32), we propose the following algorithm for multiple change- 
point detection. 

Algorithm 4.8. 

Initialize Set I = {k n ,..., n — k n }, 0 = 0 and select r n = o(n ) such that k n = o(r n ), k n —> oo. 

(i) If'il' 0 ((X,A n ) l( z±) = 0, stop and return X and 0. Otherwise go to step (ii). 

(ii) Estimate one time of change 9 using 9 n (X) from (38). 

(iii) Set X = X \ {[9 n (X)n\ - r n ,..., \9 n (X)n\ + r n }, 0 = 0 U {9 n (X)}, and go to step (i). 

Algorithm 4.8 is a sequential top-down algorithm, similar in spirit to the well-known bisection 
methods. Observe that it not only returns an estimate for the set of change-points 0 y, but also the set 
X of non-contaminated indices, which can be used for further inference. The following result provides 
consistency of the proposed set estimators. 

Proposition 4.9. On the assumptions of Theorem 3.2 and under Iff, if 

(i) for some N' = o(n/r n ), it holds that infi<j<Ar_i |#,:+i — 0* | > (TV 7 ) -1 , 

(ii) infi<j<7v S n ,i > 2kf 1/2 y/\og(n) sup te[01 ] of, 

then we have consistency ofQ, i.e.; ^(l©l = N) —> 1, and sup \9 n — 9 n \ > 0 . 

n=l,...,N 

Note that we can allow for increasing N = o(N') as the sample size n increases. The bound in 

_ 2 

condition (ii) is optimal if k n is selected in the optimal way k n oc (y^log^njn 11 ) 20+1 . 

5. A change-point test and asymptotic results for the global change problem 

In Section 3 we present methods to test the hypothesis of a-regular volatilities against local alterna¬ 
tives of less regular volatilities. An important example have been volatility jumps which violate the 
hypotheses for any a > 0. If the hypothesis is rejected for a pre-specified a, however, the test does 
not reveal if this is due to a volatility jump or due to a change of the regularity where the volatility 
is a-regular on [0, 9) and a'-regular on ( 9 ,1] with a! < a. In case of global changes the test (under 
certain conditions) rejects as well. Moreover, alternatives in Section 4 do not cover a change in the 
Hurst parameter of a fractional volatility process. The latter constitutes a different testing problem of 
great interest which is addressed in this section. We develop a new test which discriminates between 
volatilities which are a-regular on [0,1], except for a finite number of discontinuities, and volatilities 
where at 9 £ (0,1) the regularity exponent drops to a' < a such that a-regularity is permanently 
violated on [0,1]. We consider processes which satisfy the following regularity assumptions. 

Assumption 5.1. (i) Drift and volatility process in (1) are cddlag, inf /e p, [ of > 0. 

(ii) For a finite setT = {T|..... T ; y} of N < oc stopping times 

supj =1 ,...,7V \ a Tj ~ lim t<Tj,t->Tj <7i| < K with a constant K < oo. 

(iii) of = vt + Qt with the o-algebra o(q Si 0 < s < l) being independent ofo(W s , v Sl 0 < s < l). 
For any 0 < s, r < 1 with [s, r fl T = 0, we have with a constant K and some e > 0: 

(E[\u T -o s \ 8 ]) 1/8 <K\t-s\^ 2+ ^ . 


(39) 
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(iv) For A/p = n 


'(i~ i)A„ 


{q s — Qs-A n ) ds, it holds that 


2 sJE(( a M 2 -e[( a ?<?) 2 ]) 

“ “ i =2 


Op(y/n) . 


(40) 


Testing problem 5.2. Hypothesis: The process (p)t S [o,i] is a-regular in the following sense: 

E[(A» 2 ] = d 2 n + o(n- 1 / 2 ),forallfd 2 n < Kn~ 2a , (41) 

(E [(A/p) 8 ] ) 1/8 < K n~ a , for all i, (42) 

for some constant K and a > 0. Thus, on the hypothesis (er 2 )tg[o,i] is min(l/2, a)-regular. 
Alternative: For 6 e (0,1), (et/\o)t >o satisfies (41) and (42) on [0, 9). For some a! < min(l/2, a), 
b' n > 0 and for all iA n > 6, we have 

E[(A ?Q) 2 }>b' n n- 2a '. (43) 


Contrarily to the setup of Section 3, the hypothesis allows for a finite number of discontinuities in 
(^t)te[o,il- On the other hand, under the alternative the volatility permanently ‘exploits its roughness’ 
in the sense of violating (41) permanently over [6, 1], Condition (39) can be extended also to the 
case where u is an Ito semi-martingale with jumps of finite activity. Assumption 5.1 (iv) grants that 
E[(p* — p s ) 2 ] does not vary too much over time. This condition is obsolete for a > 1/4. 

Our setup covers many stochastic volatility models, and in particular it applies to discriminate frac¬ 
tional volatility models with different Hurst parameters. While most fractional stochastic volatility 
models include independence of log-price and volatility, possible dependence (leverage) is usually 
allowed in the literature using a semi-martingale volatility. In this light, the decomposition in As¬ 
sumption 5.1 appears natural, where (gt)t >o is independent of (Wt)t >o and (vf)t> o, whereas (vt)t>o 
comprises leverage. The following simple example reveals the interplay of Assumption 5.1 (iv) and 
(41) and the Hurst parameter. 

Example 5.3 (Fractional Brownian motion). Suppose that (g t ) is a fractional Brownian motion with 
Hurst parameter H. Let = n H (gkA„ ~ B(k- i)A„)> k > 1. Then, see e.g. Embrechts and Maejima 
( 2000 ), 

(i) Qt~ Bs = Qr-s = (r - s) H i i, r > s, 

(ii) |E[£o£fc] | < K (k + l) 2 ^ -2 , k > 1, for some constant K. 

The scaling property (i) implies (41) and (42) with H = a. If 1 /4 < H < 1, then (i) suffices also to 
guarantee the validity of Assumption 5.1 (iv). IfO < H < 1/2, then 


E 


max 
2 <m<n I 


£ (A“ e ) 2 -E[(A"£,) ; 


i=2 


< n 


-2 H 


E 


'[ 0 , 1 ]= 


max 
2 <m<n 


m— 1 

E 

k =1 




drds. 


Then (ii) together with the joint Gaussianity of (^h +r ffik+ s ) implies that (fk+rik+s) is a short memory 
sequence. In particular, using the results in Arcones ( 1994 ) and Moricz et al. ( 1982 ) yields that 

771—1 


n 2H sup 

0<r,s<l 


E 


max 

2<771<71 


(tk+rtk+s - E[^ +r 6+ s ]) |] = 0(n~ 2H+1 / 2 ) , 


k =1 


and hence the validity of Assumption 5.1 (iv). 
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More generally, our setup includes prominent realistic volatility models, such as fractional Ornstein- 
Uhlenbeck processes discussed in Comte and Renault (1998), also considered in Section 6. 

Because of the different permanent nature of the change, we derive a new statistical device to 
address Testing problem 5.2 which differs from the methods in Section 3. In particular, we propose a 
global cusum-type test statistic instead of localized ones. Define for i = 2,..., n: 

Qn,i = n 2 (((A”X) 2 - (A^X) 2 ) 2 - ^((A"X) 4 + (A”_ x X) 4 )) . (44) 

Our cusum-type test statistic based on the statistics (44) is 


V f = 


max 


m 


y/n — 1 m=2,...,n I fpf 


(^Qn,i 


xr =2 Qr 

n — 1 


(45) 


Intuitively, statistics Q nj are small if |of Aj — r> 2 {j | is small and become larger the larger |of A ; — 

a fi- i)A I - The regularity a thus directly influences the average behaviour of the Q nj and a change at 
time 6 can be detected by (45). 

Theorem 5.4. On the hypothesis of the Testing problem 5.2 and Assumption 5.1, the cusum-process 
associated with statistic (45) obeys the functional convergence 




<7 4 dB s — t <t 4 dB s I , 


(46) 


stable with respect to IF, weakly in the Skorokhod space, where (B s ) denotes a Brownian motion 
independent of T. 

As an immediate consequence of Theorem 5.4, we obtain 


— yt 
80 n 


v\ yi = sup 

0<t<l 


o-gdBs — t CTgdBs 


(47) 


In order to construct a test based on vj, the key object are the (conditional) quantiles 

q a (V t|jr) = inf{x > 0 : P(l/ t < x\F) > a). (48) 

The latter depend on the unknown volatility (crt) te [ 0 ,i], and are therefore not a priori available. One 
way to circumvent this problem is to locally estimate (ot) /e r 0 p and to apply an appropriate bootstrap 
procedure to approximate q a ( V' 1 \ F). Alternatively, a standardized version of (46) facilitates a test 
based on a Kolmogorov-Smirnov limit law which is given in Proposition 5.6 and requires a slightly 
stronger additional condition. For both, consider for K n oo, K n /n — > 0 


cr *A„ 


n 

3 K n 


Y. (Aj X) 4 , K n + 1 < * < 


n. 


(49) 


j=i-K„ 


For a sequence of i.i.d. standard normals 1 ,, denote with 


K y 1/2 

vj = sup | S n t tS n |, S nt =(Y L ) ^2 a 


0<t<l 


n 


(iK n +l)A n ^i 


Zi. 


(50) 


i— 1 
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Based on Vn, we construct the approximative (conditional) quantiles 

q a {V*\F) = inf{x > 0 : P(t> t < x\T) > a}. (51) 

We can compute q a (Vn \F) as accurately as we want using Monte Carlo approximations. Testing 
problem 5.2 is now addressed with the test: 


a„ )o<i<n) 


1 , ify/&VA>qi-aQti\F), 

0 , otherwise. 


(52) 


Observe that this test does not require any pre-specification of a, a'. It reacts to the change under the 
alternative for any a, a'. From a statistical perspective, it is important that 

(A) tpa (asymptotically) correctly controls the type I error under Hq, 

(B) tf’a provides optimal power (in minimax sense), 

established by Theorem 5.5 and Theorem 5.7. 

Theorem 5.5. Grant Assumption 5.1. Then, for any fixed a > 0, |P(V r ; < q a {y^\F)) — a\ —> 0. 

Before discussing property (B) of the test, let us first touch on a second approach that avoids a 
bootstrap. Slightly modified assumptions allow for the following standardized version of (45). 

Proposition 5.6. Grant Assumption 5.1 and the hypothesis of Testing problem 5.2 with a > 1/4. In 
this case (46) holds true. Moreover, (3/80) 1 / 2 V,t with 


V.J = n 1 / 2 max 

K n -\-l<m<n 


m 

2)A„) Qn,i 

i=K n -\-1 


n 

(' m/n ) (^fi- 2 )A n )~ 1 Qn,i 

i=I< n +l 


weakly converges to a Kolmogorov-Smirnov law and the associated limiting process in (46) becomes 
a standard Brownian bridge. 

Theorem 5.7. For this testing problem, the minimax distinguishable boundary satisfies 

b n oc (n~ 1/2+2a ' + n“ 2(a ~ a,) ) . (53) 


In particular, for b n = o(n + n “^) a consistent test does not exist: 

lim inf 7 ,/. (a, b n ) = 1 . 

n—foo ip ' 


Remark 5.8. The lower bound in Theorem 5 .7 reveals that detecting an alternative with “too smooth ” 
volatility, a! > 1 /4, is not possible in a high-frequency setting. □ 

The boundary b n in Theorem 5.7 is of slightly different nature than the one in Theorem 4.1 . Roughly 
speaking, the testing problem in Section 4.1 can be associated with a high-dimensional statistical 
experiment, whereas the one here is attached to a univariate statistical experiment. In this case, an 
optimal test fi> = f> a can only reach the lower bound up to a pre-specified nominal level 1 — a, 
0 < a < 1, see e.g. Ingster and Suslina (2003). Equivalently, we call a sequence of tests il: n minimax- 
optimal if for any b’ n with n - 1 / 2 + 2 °' -|_ n ~ 2 G~a') _ o(b' n ) 

Hm 70 (a, b' n ) = 0 . 


(54) 
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Proposition 5.9. The test ip' a in (52) is minimax-optimal. A corresponding test based on Proposition 

5.6 is also minimax-optimal if a > 1 /4. In particular, under the alternative of Testing problem 5.2 we 

+ P 

have Vn —> oo, when 


n 1 / 2 + 2a ' + n 2 (“ = o{b' n ). (55) 

Remark 5.10. Truncation in (45), analogously as in Section 3.2, facilitates a method robust to jumps 
of Xf . The time of change 6 can be estimated using the argmax. Precise results on the latter aspects 
are left for future research. □ 

We have established a minimax-optimal test for global changes. The methods from Section 3 react 
under some conditions also to global changes, but forfeit optimality. Combining both approaches 
provides the statistician with suitable devices to analyze volatility dynamics. 

6 . Simulations 

We examine finite-sample properties of the proposed methods in a simulation study. First, consider 
n = 1000 observations at regular times of ( 1 ) with a stochastic semi-martingale volatility model 

o t = (/ c-pdW s + J y/l - p 2 ■ cdwf^j ■v t (56) 

which fluctuates around a deterministic seasonality function 

Vt = 1 — 0.2 sin (|vrf), t 6 [0,1], (57) 

with c = 0.1 and p = 0.5, where 1C is a standard Brownian motion independent of W. We set the 
start value Xq = 4 and the constant drift a = 0.1. (57) mimics a realistic volatility shape with strong 
decrease after opening and slight increase before closing and the model poses an intricate setup to 
discriminate jumps from continuous motion based on the n = 1000 discrete recordings of X. 

Under the local alternative, we add one jump of size 0.2 at fixed time t = 2/3 to at, which equals 
the range of the continuous movement and shifts the volatility back to its maximum start value. This is 
in line with effects evoked by surprise elements from macroeconomic news in the financial context; see 
for instance Figure 1. Changing the time of the volatility jump does not affect the results substantially, 
though. One jump of X at a uniformly drawn jump arrival time is implemented for the hypothesis and 
the alternative as well, and under the alternative X additionally exhibits a common jump of X and a 
at t = 2/3. All these jumps are AV(0.5,0.1) distributed. 

Since X comprises jumps, we apply the test statistic (24). We focus on V* u with overlapping 
blocks as it significantly outperforms the test with non-overlapping blocks. For the truncation se¬ 
quence we set u n = y/2 log (n)n -1//2 ~ 3.72 n -1 / 2 ; see Remark 3.7. In all cases, we iterate 10,000 
Monte Carlo runs. 

Figure 3 illustrates simulation results for A:i ooo = 275, but minor modifications of k n do not change 
the results substantially. Null and alternative are reasonably well distinguished, but the approximation 
of the limit law is somewhat imprecise, which is typical for limit theorems with extreme value dis¬ 
tributions in finite-sample applications. Therefore, it is common practice in change-point literature to 
apply bootstrap procedures; see e.g. Wu and Zhao (2007). We apply here a wild bootstrap-type pro¬ 
cedure and use the statistics (23) to pre-estimate the (in practice) unknown volatility and a smoothed 
version applying a linear filter with equal weights and k n lags to derive an estimated volatility shape. 
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FIGURE 3. Top: Histograms of (24) for fciooo = 275 under hypothesis and alternative (right) and rescaled version compar¬ 
ing left hand side and limit law of (27) and bootstrapped law (left); limit law density marked by dotted and bootstrapped 
by solid line. Bottom: Empirical size (left) and power (right) of the test by comparing empirical percentiles to ones of limit 
law under Ho (light points) and to the bootstrapped percentiles (dark points). 


Then the statistics (15) are iteratively simulated, with X being a discretized Ito process without jumps 
and drift and with the pre-estimated volatility, to obtain critical values from the bootstrapped distri¬ 
bution. In light of the intricate setup. Figure 3 confirms a high finite-sample accuracy. The density 
curve of the bootstrapped law in Figure 3 is obtained from a kernel density estimate with R’s standard 
bandwidth selection using Silverman’s rule of thumb. While the size based on critical values from the 
limit law is not so precise, the bootstrapping works considerably well. The power for typical testing 
levels is reasonably large. 

If one is interested in the approximation of the critical values by the limit law, Figure 5 provides 
more insight. Here, we have visualized the simulation results for n = 10, 000 observations, using 
different block widths. Setting fcioooo = 500, the limiting law is very well approximated and the 
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power looks good as well. However, similar effects as before can be seen for a too large choice of k n . 
Therefore it appears reasonable to use a bootstrap procedure in such a setting as well. 

In the financial literature, many stochastic volatility models rely on fractional non-semi-martingale 
processes and recently in particular the interest in volatilities with small regularity has increased; see 
e.g. Gatheral et al. (2014). To see how our methods perform in such models, we modify our setup 
using the prominent fractional log-volatility model by Comte and Renault (1998); i.e. replacing the 
semi-martingale above by a fractional OU-process 

d(log(er t )) = -0.1 log(dt) dt + 0.1 dBf , a t = a t ■ v t (58) 

with a fractional Brownian motion (B[ r )a<t<\ with Hurst parameter H. The fractional process is im¬ 
plemented following Choleski’s method with a code similar to the one in Appendix A.3 of Coeurjolly 
( 2000 ). 

The upper part of Figure 4 presents the finite-sample precision of the test for a small Hurst param¬ 
eter H = 0.2. The outcomes are only slightly less accurate than for semi-martingale volatility and 
broadly give a similar picture. Therefore there is even finite-sample precision for detecting jumps in 
a fractional volatility process with a small Hurst parameter. Coming back to our introductory data 
example from Figure 2 for intra-day prices on March 18th, 2009, the test rejects the null for both 3M 
and GE with p-values very close to zero. The point in time where the difference of adjacent statistics 
is maximized estimates the time of the structural change under the alternative. In both examples we 
find grid point 285, corresponding to 02:15 p.m. EST, as the estimated change-point. 

Finally, we examine the test for global changes based on (45) in a simulation. Thereto, consider 
hypothesis (56) for the volatility against the alternative that (crt)t>6 follows (58) with Hurst parameter 
H = 0.15, where under the alternative the change in smoothness happens at 6 = 0.5; see Figure 
6 for an illustration. The lower part of Figure 4 confirms a remarkable finite sample performance of 
this test. Modifications of H under the alternative do not affect the results substantially. Therefore, 
we expect that the methodology in Section 5 opens up valuable new ways for inference on volatility’s 
regularity, useful for studies as the one in Gatheral et al. (2014). 
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Testing fractional OU log-volatility against jump 




Testing semi-martingale against fractional volatility 




FIGURE 4. Empirical size (left) and power (right) of the tests by comparing empirical percentiles to ones of the limit law 
under Hq (light points) and to the bootstrapped ones (dark points). Top: Test (24) with fciooo = 275; bottom: Test (45). 






















Bibinger, Jirak, Vetter/Nonparametric change-point analysis of volatility 


22 





FIGURE 5. Finite sample precision of nonparametric test (15 1 for volatility model (57) against jump under the alternative 
for sample size n = 10, 000. 

Top: Histograms of (15) for fcioooo = 500 under hypothesis and alternative (right) and rescaled version comparing left 
hand side and limit law of (18) (left): limit law density marked by solid line. 

Bottom: Empirical size and power of the test by comparing empirical percentiles to ones of limit law under Ho for fcioooo = 
500 (left) and fcioooo = 1000 (right). 




FIGURE 6. Test of semi-martingale against fractional volatility: Typical paths of the log-price under the alternative, left, 
and of the volatility under null (dots) and under alternative (lines), right. 
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Appendix A: Proof of Theorem 3.2 


First, we reduce the proof of Theorem 3.2 to Propositions A.1-A.5. The main part in the analysis of 
V n from (13) is to replace it by the statistic 


U n — max 1|> 

|n/fc n J —2 

in which the original statistics RV n y from (14) are approximated by 


(59) 


Y n.i “ F X] 

71 j = 1 


(60) 


Up to different (random) factors in front, the maximum in U n is constructed from functionals of the 
i.i.d. increments of Brownian motion, which helps a lot in the derivation of its asymptotic behaviour. 
We start with a result on the approximation error due to replacing V n by U n . 

Proposition A.l. Suppose that we are under the null. If Assumption 3.1 and (16) hold, then we have 

y/log (n) kn (Vn - Un) A 0. 

Recall that the variables Y n y are not only computed over different intervals, but come with different 
volatilities in front as well. In order to obtain a statistic which is independent of a let us define 

bn 

Y = r E i.+jW') 2 , <(>1) 


3=1 

where the volatility factor is shifted in time now. Set then 

U n — max | j_|_i 1|. (62) 

i=0,...,\n/k n \ — 2 

Proposition A.2. Suppose that we are under the null. If Assumption 3.1 and (16) hold, then we have 

yiog (n)k n (i U n - U n ) A 0. 

In the final step, we replace Y n ^ + \ in the denominator by its limit cr 2 k ^ . Set 


V n = max 

i=0,...,[n/k n \— 2 


^71,2 ^71,2+1 


2 

G Ah. A 
trcri 


(63) 


Proposition A.3. Suppose that we are under the null. If condition (16) is satisfied, then we have 

V^g W kn (i U n - V n ) A 0. 


From Propositions A.l to A.3 we have ^/log (n) k n ( V n 




0 , while 


V n = max 

i=0,...,\n/k n \ —2 


kn -i k n 

( y/nA™ kn + j W ) 2 ~ tt 


(i+1 )k n ~\~j 


wf 


3=1 


3=1 


(64) 


This statistic corresponds to the statistic D n given in (13) of Wu and Zhao (2007); see as well Propo¬ 
sition A.5. Precisely, after subtracting the mean on both sides above, their (X/, : )i</.< n correspond to 
{[y/n/SlfW) 2 ) — l) , < , which forms an i.i.d sequence of shifted Xi-variables. 

In the same fashion we can prove that the asymptotics of Vf in (15) can be traced back to the 
statistics D* in (12) of Wu and Zhao (2007), see Proposition A.5. 
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Proposition A.4. We have that y^log (n) kn {V* - V*) A 0 , with 


V* = max 

i=k n ,...,n—k n 


i+h 


r E ((V^p'0 2 - (VHA"-*,W'T) 


J=*+l 


(65) 


Theorem 1 of Wu and Zhao (2007) establishes limit theorems of the form (17) and (18) under more 
restrictive assertions on k n than (16), as they consider the behavior for a class of weakly dependent 
random sequences (Xfc)fc>i- The next proposition provides a more specific limit theorem tailored 
to the asymptotic analysis of the statistics (64) and (65). In particular, instead of using the strong 
approximation theory under weak dependence from Wu (2007) employed by Wu and Zhao (2007) 
to prove their Theorem 1, we rely on classical bounds for the approximation error in the invariance 
principle for i.i.d. variables with existing moments. This is applicable in a more general setup with 
much smaller block lengths k n . 

Proposition A.5. Consider a sequence (Xk)k£ft of i.i.d. random variables with Var [X^] = f 2 and 
mx k \ p ] < oo for some p > 4. If 

kf p/2 n = o((log(n))~ p/2 ), (66) 

then with m n = [n/k n J the statistic 


D 


* 

n 


1 

-— max 

kyi kpi < ^- 7 ^ 71 kyi 


k n +i 

E 


j=i+1 


E x t 

j=i—k „+1 


obeys the weak convergence: 

\/ log (m n )( k+ 2 <X 1 ) D* n - 2 log (m n ) - ^ log log (m n ) - log3 A V , 
where V is distributed according to (19). The statistic 


Dn l<t<“j -2 I ^ Xikn+j I 

j= 1 

using non-overlapping blocks satisfies under the same assumptions 

- [4 log(m n ) - 21og(log(m n ))] 1/2 ) ^4 V . 

As all moments of the Xi distribution exist and k n is at least of polynomial growth in n, Proposition 
A.5 applied to (64) and (65) implies Theorem 3.2. We start with the proof of Proposition A.5 and then 
show by proving Propositions A.1-A.4 that the preliminary reductions are in order. 

Proof of Proposition A.5. By a simple rescaling argument, we can restrict ourselves to the case 
Var[Xfc] = 1. The Donsker-Prokhorov invariance principle guarantees weak convergence of partial 
sums of (Xfc)fceiN, rescaled with yfn, to the law of the standard Brownian motion as n -» oo. Let 
M be a sequence of centered i.i.d. Gaussian random variables with E [Zj] = E [Xj] = 1. 
Observe that 


max 

k n <i<n—kn 


kn+i 

E x - 2,) 


j=i +1 


E ( x i~ z i) 

j=i—k n +1 


i 


< 4 max 

k n <i<n 


E ( x i~ z i) 

4=1 
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We exploit the classical theory on bounds for the approximation error of partial sums of the above 
type associated with the invariance principle provided by the seminal works of Komlos et al. (1975), 
Komlos et al. (1976), Zaitsev (1987), and related literature. Let (rc n ) ne w be a sequence with x n > 0 
for all n. According to Theorem 4 of Komlos et al. (1976) or equivalently (1.6) of Sakhanenko (1996) 
with Markov inequality, the sequence (Zj) . g]N can be constructed in such a manner that 


max 

k n <i<n 


i \ 1 n 

'EiXj-Zj) >x n ) <cw£Epx 


3 =1 


3= 1 


<c 2 4 


with constants C \. C 2 which may depend on p. Selecting x n = y/k^5 n with 5 n = (log(n)) -1 / 2 , the 
conditions to apply Theorem 4 of Komlos et al. (1976) are in order and we get from condition (66) 
and the above that 


k n +i 

max ^ ~ Z i 

j=i +1 


£ (Xj-Zj) 

j=i—k n +1 


Op(y/k^(log(n)) 1/2 ). 


(67) 


Denote with B (k) = Y^j= 1 and define 

H(u) = (1(0 < u < 1) - 1(-1 < u < 0)) /V2. 


Then by (67), it follows that 


VKK 

V2 


\/2k n k n <i<n—k 


max |B(i + k n ) — 2B(i) + B(i — k n ) | + 




I O(Rn) 
y/K 


Op(l) 
\/ 10 g( n ) 
Op(l) 
\/log (n) ’ 


where 72 n = sup{|B(u) — B(tt')| : G [0,n], — n'| < l} = Op(^/log(n)) by standard 

properties of Brownian motion. Then, since (log(n)) 6 = o(k n ) by condition (66), we may apply the 
limit theorem from Lemma 2 in Wu and Zhao (2007) with a = 1, Dh,i = 3 and b n = rnf 1 for 
m n = (n/ Ay,j ; see Definition 1 and Lemma 2 of Wu and Zhao (2007). In the same manner, we can 
use that 



j=i +1 


i \ / 

y! ^ 

j=?-fc„+l / \j=i+l 


- X! Z ij + c ’ p (v /l °s( n ))’ 

j = i~ fcn + 1 / 


for E[26 2 ] = 1 with (Zj) . g]N again a sequence of centered i.i.d. Gaussian random variables. Lemma 
1 of Wu and Zhao (2007) then ensures the limit theorem for non-overlapping blocks. This completes 
the proof of Proposition A.5. □ 


Proof of Proposition A.l. First, a standard argument as e.g. laid out in Section 4.4.1 in Jacod and 
Protter (2012) allows us to strengthen Assumption 3.1 and to assume that all local conditions are in 
fact global. That is, we assume without loss of generality that |a s | < A', 0 < a 2 < n 2 < K and 
'wsict) 1 < Kb a for a generic constant K. 

Let (aj)j=and ((>/)?;= 1 .....rr, be arbitrary reals. Obviously, for an arbitrary i we have 
|ai| < |cti — bi\ + \bi\ < max |a* — b{\ + max \bi\. 
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Therefore the inequality holds with the left hand side replaced with maxj = i v .. m a, as well. Applied 
to V n and U n it is simple to deduce 


< 

max 

2=0,. 

...,|n/fc n J—2 

< 

max 

2=0,. 

...,|n/fc n J—2 


( 68 ) 


RVn 


RVr 


n,i +1 ^n, 2 +1 


+ max 
i=0,..., \n/k n \ —2 


RVn.i Y n ; 


Y r 


n,i+l 


Let us begin with the second term on the right hand side above. For all e > 0 and all constants D > 0, 
we have 


max 

i=t),...,\n/k n \-2 


y/k n log(n) ( ll\ /T v j Y n i 


Y n f +1 


> e 


< p | 

max 


\i=0,...,ln/k n \—2 

< p | 

max 




(69) 

To keep the notation readable, here and below we use standard probabilities and expectations without 
an extra indication that we are on the set Q c . 

Since we have of > cr 2 _ > 0, we can use the same arguments as in the proof of equation (22) in 
Vetter (2012) to derive 


max l/|V„,i+i| > D = 
j=0,...,Ln/fe n J—2 


min |y„,i+i| < D I -T 0 

i=0,...,\n/k n \ — 2 


with e.g. D~ 1 = The intuition behind this result is that the probability of a mean of k n i.i.d. 

variables with all moments deviating too much from its expectation becomes exponentially small in 
k n . A similar argument will be given in (77) later. Also, here we need that k n is of (at least) polynomial 
growth, which is included in (16). 

On the other hand, using Ito formula we obtain 

^k n \og{n){RV n , - Y nti ) = g {(£? kn+j X) 2 - a 2 knAn (X? kn+j W ) 2 ) (70) 

V^n - =1 


n- v /log(n) 


y/kn 


/ k 

r (ikn+j) An 


r(ik n +j)A n \ 

E 2 

/ (X s - 

- x (ik n +j-i)A n )a s ds + y J 

/ (°l-°k,A r ,)ds\ 

\j= i j 

'(ik n +j— 1)A„ 

j= i' 

'(ikn+j— 1)A„ J 


+ 


n 




n) 


n r (ikn+j) A„ 


£2 


y/kn ■ 1 J (ikn+j— 1)A, 


((X s X( ikn+ j_ 1 ' )An )<T s (W s W( ikn+ j_i-) An )a iknAn ) dW s . 


Using this decomposition, we split the discussion of 


( max y/k n \og(n)\RV n> i - Y n j | > e/D ) 
\i=0,...,\n/k n ]—2 J 


into three parts. For the first term, observe that 

Li r(ik n +j) A 


n 


max 


V^og( 


n) 


i — 0,...,|n/fc n J 2 yj k n 


I i X s x (ik„+j—i)A n ) a s ds >e/(3 D) 

j =1 J (ikn+j- 1)A 
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\n/k n \ -2 


< 


E 

i =0 


n 


y/W 


n) 


y/^n 
\n/k n \ —2 


E 2 


n (ik„+j) An 


=1 J(ik n +j- 1)A, 


(-^s i)A n ) a sds > e/(3D) 


<(e/(3D))~ r E 


i =0 


n 


a™ 

/ , ^ / (Xs ^(ifcn+j-l)A„) a s 


y/kn 


j—l J(ik„+j— 1)A„ 


for all integers r. Applying a standard bound based on Jensen’s and Minkowski’s inequalities yields 


E 


n^logln) 


'•(ifcn+j)A„ 


n — / (-^s ^(ifc n +j-l)A n ) a s 

V^n J{ikn+j— 1)A„ 


< A' r 


n 


V^og( 


n 


yfkn 


jfikn+j) An 

'(ikn+j- 1)A, 


E[|x s -x (ifcn+i _ 1)An n i/r d S 


K r here and below denotes a generic constant depending on r. Burkholder-Davis-Gundy inequality 
gives for any s € [{ik n + j - l)A n , (ik n + j)A n \: 

n\Xs - X {ikn+j _ 1)A J] < K r n~ r / 2 , 


E 


V /r - / (X, - X (ifen+j _ 1)An )a s ds < K r (nkn)~ r/ 2 log r/2 (n) 

V&n ^(i/cn+j—l)A n J 


We conclude that 


max 


77-a 77,^ I \^ n -\ /*(ifcn,+j)A n 

V u 1 E 2 / (*- - ^(ifcn+j-tJAja. <M > e/(3-D) ) (71) 


z—0,...,|n/fc n J 2 \/ kn 

< (e/ (3D))~ r \n/k n \K r k r J 2 n~ r ^ 2 log r ^ 2 (n) 0 

for r > 2 arbitrary. Regarding the second term in (70), on (A we have 

ny / log(n 


max 

i=0,...,|n/fc n J — 2 


r(ik n +j)A n 


E 


< 


max 


v^’fi J =1 7(jfc n +j —l)A n 
ny/k^n) ^ [(ikn+j )An 


- ^aJ ^ 


i—0,...,Ln/fc„J 2 y/k n j=l (Ikn+j—l)A r 


k s 2 -^LaJ^ 


< y/knWknA n (k) 1 \/l°g(n) < K y/kn{k n lA n ) a yJ\og(n ), 

which converges to zero by (16). Observe that addends above involve interlacing time intervals such 
that for a a continuous Ito semi-martingale the bound above applies with a = 1/2 and is sharp. 
Finally, we have the further decomposition 


(Xs -^(ifc„+J-l)A n) a s (W. W(ikn+j-l)A n ) a ik n A n — a s 


+ (°s ^iknAn) / dW^ + CTik n A n / 

j (ikn+j— 1)A„ 7 (ikn+j — 1) A 

We proceed in a similar way as above: 

kn "(ikn+j) An rs 


' a u du (72) 

(ifcn+J-l)A„ 

(ku <7"ifc n An) dW u . 


n 


x/iogC 


n 


max — ’ — | 2 / (j s I 

i=0,...,|n/fc„J-2 vA: n ^ J (ikn+j—l) An J (ik n +j— 1)A. 


a,,dudW q 


> e/(9D) 
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<( E /( 90 )r £ E 


[n/kn J 2 r __ /i_ k„ p(ik n +j)A r 


i =0 


nsj log(ra) 


E 2 


(To 


V^n ■ 1 7(ifc n +j-l)A n J(ik„+j— 1)A, 


a v du dWc 


Precisely, let r = 2m and set 


c, = > cr. 


E' 


J =1 */ (ikn+j— 1)A„ 

Then we have in a similar way as before 

kn r(ikn+j) An 


a u d'nl[(i/ Cn +j_i)A n ,(ifc„+j) A„) ( s ) 


E 


ny / log(n) 


' • V2 a s I 

\k n =1 J{ik n +j- 1)A„ J (ik„+j— 1)A 


a„ du dW7 


2mi 


_22m 

_22m 

< -Ff m 


n 


2m 


y / log(n) 

y/kn 

n^/log(n)\ 2m 


y/ kn 

n v /log(n) 

y/T^n 


'*(2+1) Ain An 


Co dWo 


2m- 


iAin An 
2m / /*(i+l)AinAr, 


iAln An 


(i-)-1) Ain An ^ 777,- 

c 2 s ds 


E [ c 2m] 1/ m ds V m _ 


With 


(73) 


E[c^] = ^E 

j=l 




2m 


\ 2m- 


{ikn+j— 1)A„ 


a u duj 


\{ik„+j— l)A„,(ifc n +j)A n )( s ) E 7\ m A n , 


we obtain 



= log 2m (n). 


By choosing m large enough, the term in (73) converges to zero. Similarly, 
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< K m r-^W ) 2m ( f (l+1)knAn (( kn A n ) 2ma A™) 1/m ds) m < K m (k n A n ) 2ma . 

J ikn An 


\/ h'n 


The same upper bound is obtained for the third term in (72). Again, choosing m large enough yields 
convergence to zero. Altogether, we conclude 

P f max \/k n \og(n)\RV ni i - Y n ^\ > e/Dj ->• 0, 

\i=0, ...,[n/k n ]—2 J 

and we are done with the second term on the right hand side of (68). Next, consider the first term on 
the right hand side of (68). For any e > 0 and any D > 0: 


max y/ k n log(n) | RV n , L 

i=0,...,ln/k „\—2 \ RVn.i-v 1 Y r 


1 


1 


'n,i -\-1 1 


> e 


< 


max yjk n log(n)\RV n j(Y nti+ i - RV n ,i+\)\ > e/D ) 
i=0,...,|n/A: n J—2 J 


+ IP I max l/\Y n , i+1 RV n , i+1 \ > D 


Observe that 

min | ^ D 

K i=D,...fn/k n \— 2 

< P ( min |y n , i+ i| < -D -1/2 ) +f( min \RV n , i+l \ < D~ 1 ^ 2 \ 

\i=0,...,\n/k n \—2 J \i=0,...,Ln/fe n J—2 J 

< P f min |Y^j + i| < D~ 1 ^ 2 ] + P ( min < 2 D~ 1 ^ 2 

yi=0,...,Ln/fcnJ~2 ’ J \i=0,...,Ln/fc n J—2 


+ P ( max \RV n i+i - Y n ,i + i\ > D 1/2 ) . 

\i=0,...,|_n/fc n J — 2 J 


All three terms on the right hand side have already been discussed above for an appropriate choice of 
D, the latter term even with an additional factor yjk n log(n). Similarly, for all F > 0 we have 


max yjk n log(n) | RV n j(Y n ,i +1 RY n ,i+l)\ > t/D 

i=0,...,|n/fc n J — 2 


<PI max \RV n i\ > T ) +P ( max yf k n \og{n)\Y ni+ i - RV n ,i+i\ > e/(DT) 

K i=0,...,[n/k n J-2 1 J \i=0,...,ln/k n ]-2 V 

Here we only have to focus on the first term, for which we use 


max | RV n * | > T 

i=Q,...,\n/k n \—2 


< P I max \Y ni \ > T/2 + P max \Y n t+l - RV ni+ 1 | > T/2 ) . 
y i=0,...,[n/k n ]-2 J \i=0,...,[n/k n ]—2 ’ ’ ' 


(75) 


The same arguments which were leading to equation (22) in Vetter (2012) show that the first proba¬ 
bility becomes arbitrarily small for large enough T, this time because we may assume a is bounded 
from above. The second probability has already been discussed above. □ 
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Proof of Proposition A.2. We have to show convergence in probability to zero of 

y/k n log(n) Y n i (Y n ,i-\-l Pn,4+1) 


■Jk n log(n) max 

j=0,...,Ln/fe„j—2 


Y Y 

1 72,2 n,i 


^ 72 , 2 +1 


72,2+1 


max 

2=0,...,|_n//c n J — 2 


^72,2+1 ^72,2+1 


Using equation (22) in Vetter (2012) again, we may focus on the numerator above only, and for the 
same reason as in (75) it suffices to prove convergence to zero of 


( max y/k n log(n)|l^ i+i - Y Uti+ 1 | > e ) 
V*=o,...,Ln/fcnJ-2 v ’ J 


(76) 


= r (.,= 0 „-Slj- 2 7*nlog(n)|4»i„ - I Vn 


> e 


< 


P ( max \/k n log(n)|<x 
\i=0,...,|n/fc n J-2 v 1 ' 


ik n An a (i+l)k n A n | - > 


+ r | max 

j=0,...,Ln/fc n J—2 


7=1 


> 2 


for all e > 0. Regarding the first quantity, recall that on f! c by (16) 

max Jk n log(n)\af knAn ~ af i+1)knAn \ < y/k n log(n)w knAn (^)i 

2—U,..., 1^72 /Z 

< Ky/k n {k n /S. n ) a yJ\og{n) ->• 0 . 

On the other hand, 


max 

i=0,...,Ln/fe?iJ—2 


n+logfa) 


k>, 




> 2 


7=1 


|n/fc„J-2 /,- 7 ^ fe 

n\/loe [n 


£ E 

i=0 

|_ra/A: n J—2 

< E p 

4 = 0 

[n/fcnj —2 

< E E 

2—0 


kn 

V^ogC 


n 


kn 

\/iog( 


71 j 


kn 


E< A (< + 1 )k, +i W? >2 

7=1 

kn 

■ E ((^s+ut+juo 2 -1) 

7=1 

kn 

E ((v / " A ”i + i) fc , + , f wo 2 -1) 

7=i 


> 1 


2mi 


(77) 


for all integers m. Due to the i.i.d. structure, the latter term is bounded by K m (n/k n )k n m log m (n), 
which converges to zero for m large enough. □ 


Proof of Proposition A.3. We have to show convergence to zero in probability of 


Ynf ^n,i +1 ^ n,i ^n,i+1 


\/k n log(n) max 

i=0,..., \n/k n \ —2 


Y r 


72,2+1 


2 

& a 

u\,n ‘-*n 












































Bibinger, Jirak, Vetter/Nonparametric change-point analysis of volatility 


31 


= sj k n log(n) max 

i=0,...,\n/k n ]—2 


( Y n,i Y n,i+l) ( Y n,i+1 ^ A„) 


Y n,i+l a ik n A n 


It is sufficient to focus on the numerator, and we discuss two terms separately, using 

F ( max, \]k n \og(n)\{Y n ,i - Y n}i+l )(Y n , i+1 - cr 2 fenA J| > e ) 
\Jk n log(n) 11 n yi Y nti+1 \ > \/e^ 


< r I max 

^i=0,...,[n/k„\-2 


+ p l™l, \ Y nf+l - °ikn A„l > 


The first term has already been discussed in (76), while 


. _ ™ . J Y nj+l - °ik n A n \ > 
*=0,...,Ln/K n J—2 


< 


i=0,...,Lro/fc n J— 2 


max 
i=0,...,Ln/fc n J—2 


£((^ A <m>*„ +J -WT - 1) > ^ 


3 =1 


-j k n 

7tE((^ A “, + 1)t „ +i FF) 2 - 1) > Vi/^ 


J=1 


using cr 2 < A'. The claim follows from (77). 


□ 


Proof of Proposition A.4. For the test statistic V') from (15) our proof follows the same stages as 
the one for V n via Propositions A.l, A.2 and A.3 above. We start proving 


Vkn log(n) {V: - 17*) A 0 


for 


(7* = max 

i=k n ,...,n—k n 


KU + i^W 2 




-1 


Similar to (68), we find that 


K*-^l<. max 

i=k n ,...,n—k r 


i / i+kn i 

E < A A) 2 (( E (A”W)" - ( E 

j=i—fc„+l v j=i+l j=i+l 

EAt +1 « A ?*> 2 - 4-MA„( A i w > 2 ) 


-1 


n 


+ max 

. £I%3i i <SL.&WY 

Following an inequality analogous to (69), the key step is to show that 


n)k n max 

i=kn,...,n—kn 


n 

hr 


E (( A " X ) 2 -4-K)A,(^Wf)\> e /D)^0, (78) 


j=i—k n +1 
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while for ay bounded from below we readily obtain 


n 


mm 

i hji ,... ,72— fcjj 


i+k n 

E 

j=i +1 


2 

°7a„ 


(AW) ! 


< D 


-i 


0. 


For the proof of (78) we proceed with a decomposition analogous to (70) and for the first term along 
the same lines as above leading to (71). However, the maximum extends now over the larger set of 
all indices i = k n ,... ,n — k n , and thus instead of (71) the upper bound yields (e/ (?>D))~ r n 1 ~ 5 Ay/ 2 
log r/2 (n), which is a factor k n larger than above. Still, choosing r sufficiently large the term tends 
to zero. The same reasoning applies to all terms for which we have used Jensen’s and generalized 
Minkowski's inequalities above as (74). 

Upper bounds exploiting the smoothness of the volatility remain as before, for instance 


y/log (n)k n max 

i=k„,...,n—k„ 


Jyn f(j+i+k„-l)A n 

r E/. . Jds 

71 j— 1 J l)A n 


< K- 


r ny/\og(n) 


s/k n 


Y^(jA n ) a Ar 


3 = 1 


< Ky/k^(k n A n ) a ^log(n) 


with a constant I\ on Assumption 3.1. In this fashion, all terms generalizing the expressions in the 
proofs of Propositions A. 1, A.2 and A.3 are controlled and we conclude Proposition A.4. □ 


Appendix B: Proof of Proposition 3.5 

Recall the definition of the general Ito semi-martingale in (20). Again, by the usual localization pro¬ 
cedure, we can work under the reinforced assumption that the process X t and its jumps AX t are 
bounded as well. We will then work with the decomposition Xt = Xq + Ct + Jt, where Jt denotes 
the pure jump martingale 


Jt = 


5{s, x)((A — u)(ds , dx) 


0 J R 


and the continuous part becomes 


Ct — / (i s ds T / & s dW s 
Jo Jo 


with a s = a s + f R k(S(s, x))X( dx). The latter integral is finite for bounded jumps. 
We shall prove only (27) of Proposition 3.5 by showing that 


\Jk n log(n) 


max 

i=k n ,...,n—k n 


ST X}=i-fc n +l( A " A 1 2 ] 1 {|A« Y|<« n } 




-1 


max 

i=kn,-",n—k n 


lEU- kn+ i(^c) 


- 1 


0. (79) 


^E^i(A^) 

Following a decomposition of the eiTor term as in (68), we have to show that 

^Y, l j=i-k n +l( A j X ) 2l {\A^X\<u n } |(X}=i-fc n +l( A "A6) 2 l{|A"X|< M „} 


y/k n log(n) max 

i=k n ,...,n—h 


£Ej;hi( A " x ) 2i {iA-xi<»„} 
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+ y/k n log(n) max 

i=k n ,...,n-k n 


£E‘ =i - fc+ i - (A?C) 2 ) 


0. 


Both terms are handled similarly and we restrict to the second one. It suffices to prove that 

ny / log(n) 

j=z-fc n +l 


max 


— k n yj 


E ((A"A) 2 l { |i ? x|<„») - (A”C) 2 ) 


>- ^0, (80) 


for all e > 0 and constants D > 0, as (79) then follows with (69) and the same bound for the second 
probability as in the proof of Proposition A. 1. As maxi A''C' = o n . s .()j ri ) by basic extreme 
value theory we can work on a subset of 0 where maxi<j< n | A"C| = o(u n ). Observe that on this 
subset 


max 

i=k n ,...,n—k n 


E (A”X) 2 1 {|a ,. x| <„j - (A”C) 2 ) 


j=i-k n +1 


<K max ( V l,| i?A | > „. ) (A”C) 2 + ((| A” J| A »„) 2 + {| A” J\ A U „)| A”C|)) 

i — k n ,...,7i k n \ # J ' 

J=Z-/C n + l J=7-/c n + l 

with some constant K. 

Pertaining the first addend and using maXj(A-'C) 2 = Op(A n log(n)), we have to ensure that 

i 

max E 1{|A"A'| >u n } = o P (v^/log 3 / 2 (n)) . 

i — K n ,...,n k n . . J 

j=i-k n +1 

Let p with 1 < p < (2rr) _1 be arbitrary. We use the decomposition X = X' n + X" n with 


X t n = 



0 7R 


5(s, x)l Mx)>u r } v(ds, dx), X t n = X t - X t n , 


and define A” = {| A'-X n \ < u n / 2}. Finally, N" is the counting process 


K = 


1 { 1 {x)>ul}Vi ds i dx )- 


/0 JR 

We know from (13.1.10) in Jacod and Protter (2012) that 


E 


u=fc“ a n-fc n S l {\^x\>u n} i m n < E p ((^ n ) c ) ^ 0 

j=i—k „+1 i=l 


for all suchp. Then, using l|| A n Y| >Uii }l{^nj < 1{| A ?1 x" n \>u > all we have to show are conditions 
under which 


max Y] 1{|A n w n |>t} = Op{y/kn/ log 3//2 (n)) . 

i=kn,...,n—k n . J 

j=z-fc n +l 
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Obviously, 

i 

1 {|A"7V n |>l} < Ka„ ~ N (i-k n ) A n > 

j=i-kn +1 

and N n is a Poisson process with parameter / ]R l{ 7 ( :r )> u ';} A(clx) < Kun rp \ see (13.1.14) in Jacod 
and Protter (2012). It is clearly enough if the probability of more than l < oo jumps on one block 
converges to zero, i.e. 


n 

U l^+DA. - ''S-t+DA. 2 '}) i'Ka. > 1) 


j=k n 


Thus, we need the condition that for some p > 1 and some l < oo: 

k l n AjC 1 -*^)- 1 o and 2rr < 1. (81) 


Bounding the second term above comprising small jumps in case of non-truncation poses a more deli¬ 
cate mathematical problem. We restrict to the quadratic jump terms as the cross terms lead in the same 
way to an obsolete weaker criterion. For finite activity it is enough to ensure that nk n 12 y / Tog(n)rqj —> 
0. Else, define the sequence of random variables 

Zi = (|AjJ| Au n ) 2 -E[(|AjJ| A u n ) 2 }, i = 

Note from equation (54) in Ait-Sahalia and Jacod (2010) that we can bound moments of (| A,./ A u n ) 2 
in the following way: 


E 


(I A” J\ A , u n ) 2 | J 7 ( i_ 1)Ajl 


< KA n u n 


2—r 


¥ar (|A?J|A<f|.F (i _ 1)An <E (| A? J\ A u n y\ F {i _ 1)A „ 


< u n KA n u n r = KA n u n 


4—r 


for all i = 1,..., n. We decompose 


max 

i=k n ,...,n—k n 


E (|Aj J| A Un) 2 

j=i-kn +1 


i 


max y Zj 

i=k n ,...,n-k n . f—f , 

]=l-k r i+l 


i 

+ max E 

i=k„,...,n—k„ . f—f 

J=l-kn +1 

( Aj J A rt n ) 2 


where the condition 


s/knU 2 n r y/\og(n) -A 0 


(82) 


renders the second term with the expectation asymptotically negligible. Yet, the derivation of the 
maximum in the first term from its expectation can in general become much larger. Observe that 


i—k n 


max 

i=k„ ,...,n—k n 


y Zj = max \y Zj — y Zj < 2 max > Z 

^ J J i=k n ,...,n—k- I J ^ J J i=h- n I z~— J 


j=i—k n +1 


i=l 


3 = 1 


i=k n ,...,n I 


3 = 1 
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Having a sequence of independent and centered random variables, we can apply Kolmogorov’s max¬ 
imal inequality: 


max 

i=k n ,...,n—k n 


E 

j=i-k n +1 


Zi 


> A 


n . 


< ^¥ar(Z!) < A 


-v- r 

U, n 


(S3) 


Thereby we conclude that ma x-i=k n ,...,n-k n 


spi 

/_^j=l-k n +l 



0 P (un~ r/2 ) 


. We obtain the condition 


r?yiog(n) 2—r/2 

VK Ur ' 


(84) 


In conclusion, the conditions (81), (82) and (84) ensure (79). A careful computation finally proves 
that (82) is in fact obsolete, which yields our claim. 0 


Appendix C: Proof of the lower bound and consistency for the local problem 


Proof of Theorem 4.1. The proof is based on equivalences of statistical experiments in the strong Le 
Cam sense. After information-theoretic reductions, we subsequently move to statistical experiments 
that allow a simpler treatment; see (87) below. Our final experiment £4 is a special high-dimensional 
signal detection problem, from which we will deduce the lower bound by classical arguments. 

First consider alternatives with a jump as in (4). Here, throughout this proof, we set 

kn = c k (y/log(m n )n a / Ln )^ 1 , (85) 


with a constant c/, : > 0. In the preliminary step, we first grant the experimenter additional knowledge. 
We restrict to a sub-class of SjJ (a, b n , L n ) from (3), where we have one jump at time 6 E (0,1) in 
the volatility, |<jg — a^_\ > b n . Then, we assume that Qnk E {1,2,..., [n/k n \ — 1}, such that 
the jump time is in the set of observation grid points which are multiples of k n . Furthermore, we can 
stick to Xq = 0 and o s = 0,s6 [0,1]. From an information-theoretic view, obtaining this additional 
knowledge can only decrease the lower boundary on minimax distinguishability. Consequently, a 
lower bound derived for the sub-model carries over to the less informative general situation. 

To ease the exposition, we first set er 2 . = 1 and L n = 1 and generalize the result at the end of this 
proof. Next, denote with [a]b = a mod b and let 


2 

a j A„ 


1 + (k n - \j}knY n °, 9n< j < 9n + k, 

1, else. 


( 86 ) 


The discretized squared volatility exhibits a jump (resp. change-point) of order b n at 9 and then decays 
on the window [ 9 , 9 + k n A n ] smoothly with regularity a and is constant elsewhere. It suffices to 
consider the sub-class Xg C 5/(a, 6 n ,L n ) of squared discretized volatility processes of the above 
form for which it remains unknown on which window the jump occurs. 

Introduce a sequence r n with r n oc such that r n kf 1 —y 0 as n 00. We specify the following 
stepwise approximation of (CjA n )o <j<n £ A0 


a j A„ 


1 + (k n - ir n ) a n 0 

1, 


9n + (i - 1 )r n <j<9n + ir n , 1 < % < 

else. 
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Denote the observations by i}j = <T(y_i)A n (WjA„ - 'i,;,-no,. ) and )|, - o 1| _ nA i 

—i)A n ) > J = 1, ■ ■ •, n, respectively, with W the Wiener process in (1). In the sequel, it 
is convenient to distinguish the two cases where a > 1/2 and a < 1 / 2 . 

Case a > 1/2: As alluded to above, we relate different experiments: 


Si 

£2 

£3 


: Observe (r/y), ^ and information (ink 1 
: Observe (^) 1 < j < n 
: Observe x = ((of 


i<j<n --—---n G { 1 , 2 ,..., [n/k n \ - 1 } is provided. 

and information Qnk “ 1 £ { 1 , 2 ,..., [n/k n \ — 1 } is provided. 


' jU A 

orbit la ■ 


t^OieXi ’ ( a i 


eXi ’ \ u e+{i-iy n A n 


xOiex 2 )’ w ^ ere indices ( ik n ,i £ X\) expand 


over all multiples of k n , except the one where the jump is located, i.e. = { 1 ,..., dnk~ l — 
1, dnk- 1 + 1,..., [n/k n \ — 1}, and {On + (i — 1 )r n , i £ I 2 ) over all multiples of r n in the 
window of length k n A n where (<r|) is non-constant, i.e. Z 2 = { 1 , 2,..., k n r~ 1 }. (xi)teXi and 
(Xi) ,; e x 2 are i.i.d. random variables having chi-square distribution with degrees of freedom k n 
for i £ T\ and r n for i £ l;- Moreover, information Onk~ 1 £ {1, 2,..., [n/k n \ — 1} is 
provided. 

^—1/2^ ~2 , '—2 \ ( — l/2Z-~2 , ~2 

'^iknAn + a ik n A n ) V n k,i a 0+(i- l)r„A„ + ^P+F-lpnA,, 


£4 : We observe £ = ((fc n 1 2 | 


i) i£X 2 ) 


where (£j,£j) are i.i.d. standard normal random variables. Moreover, information Onk n l £ 
{ 1 , 2 ,..., [n/fc n J — 1 } is provided. 


When considering the above experiments, we always have (<x|a„) G A# (or (<Xy An ) G Ay) as un¬ 
known parameter that index a family of probability measures {P/ 2 ,}. For the sake of readability, 

' j^n' 

we move this formalism to the background and omit subscripts indicating the parameter space. We 
show the following relations for the experiments, where ~ marks strong Le Cam equivalence and ~ 
asymptotic equivalence: 


£\ « £2 ~ £3 ~ £4- 


(87) 


Finally, we shall derive the lower bound in £4 which carries over to £\ by the above relations and thus 
also to our general model. The proof is now divided into four main steps. 

Step 1 £\ ~ £ 2 ' For random variables U. V and their laws P u, Py, we denote the Kullback-Leibler 
divergence D([71| V) = D(P[/||Py) = / log (dFu/dPv)dPu- For normal families with unknown 
variance Pg = N{ 0, 0), it is known that 


D(P fl ||P fl /) 


Eg 





such that for 6 = O' + <5 and considering asymptotics where 5 —y 0, we obtain 

D(P»' + a||P»-) = -i(log(l + /)-/) = +°(A • < 88 > 

As £\ and £2 share a common space on which the considered random variables are accommodated, 
asymptotic equivalence holds if ||P( r?i ) — P(?j,) 11 TV' -A 0 as n 00 where || • \\tv denotes the total 
variation distance and P(, ?J ) the law of observations (rjj). We exploit Pinsker’s inequality 

\\ p (vj) ~ p (%)llxt/ ^ l D ((%)!!(%)) • ( 89 ) 


By Gaussianity and independence of Brownian increments, implying additivity of the Kullback- 
Leibler divergences, it follows with ( 88 ) for a piecewise constant approximation of a function with 
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regularity a on k n r n 1 intervals of length r n A n : 


kn'T'n f'n 


D(fe)llfe))=0(l) X J20 A n) a = 0(n- 2a k n rl a ) 
i =1 7=1 

which tends to zero for r n k~ L = 0(n ~ e ) for some e > 0 . 

Step 2 £2 ~ £:>,'■ The vector of averages 

bn 

:‘E« 


1 E 'S, 

7=1 


v n 


7=1 


n+(i-l)r n +j-l) . £lr> 


forms a sufficient statistic for (<rj Thereby we conclude, see e.g. Lemma 3.2 of Brown and 

Low (1996), the strong Le Cam equivalence. 

Step 3 £3 ~ £ 4 : Let *<> = {k~ 1/2 (af knAn ( X i ~ M) ieXl , A/ 1 / 2 (^+(i-i)r„A„fe - ^)) ieX2 ) 
and^ = (fcX,Aj lG xp (?i^ +(i _ 1 )? , nA J ieX2 ). In both experiments random variables are accom¬ 
modated on the same space. Rescaling and a location shift yield with Pinsker’s inequality 

ll p x - p ellL = ll«v - «y Wlv £ 5 D (xlir). 

By independence, it follows that 

D(xir) < D fc 1/2 4fe - k n )Ui- 

ieh 

+ X D ( r n 1/2 ° 2 c . (i—l)r„ (Xi ~ r n) Mi° 2 c (i-Dm ) • 


i n, 


*ex 2 


An application of Theorem 1.1 in Bobkov et al. (2013) yields 

XI B { k n 1/2 °ik n A n (Xi - k n )\\&a? knAn ) = O(nk~ 2 ) , 


ieh 


X D ( r n 1/2 ^+(i- 1 )r„A„(Xi - rn)||li^ +(i _i )r „ A J = 0(k n r n 2 ) . 

tex 2 

For a > 1/2, we have nfc“ 2 = o( 1). Choosing r n sufficiently large such that k n r~ 2 = o( 1), it 
follows that 


■X 


£ Wtv 


= o(l), 


(90) 


what ensures the claimed asymptotic equivalence. 

Step 4: By the previous steps, it suffices to establish a lower bound for the distinguishability in ex¬ 
periment < 54 . Adding an additional drift, which gives clearly an equivalent experiment, we consider ob- 

“ 1 )iSXi ’ X &^0+(i-l)r„A n + °0+(i-l)r n A n ~ 


servations £ = ( (k n 1 / 2 ^af k 


'ik n A n + a ik n A n l ) ieXl ’( rn ' & a e+(i-l)r n A n + a e+(i-l)r n A n 

Then, the testing problem can be interpreted as a high dimensional location signal detection problem 
in the sup-norm. More precisely, we test the hypothesis 


Hq : sup (< 7 / — 1) = 0 against the alternative iTi : sup (< 7 ^ — 1 ) > b n , 

7 7 


( 91 ) 
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and we are interested in the maximal value b n —> 0 such that the hypothesis Ho and H\ are non- 
distinguishable in the minimax sense. Non-distinguishability in the minimax sense is formulated as 

lim inf 7 , /.(a. b n ) = 1, (92) 

n—> oo ^ 

_ a 

and the detection boundary here is b n oc (k n A n ) n oc n 2a + 1 . In order to show (92), we proceed in 
the fashion of Section 3.3.7 of Ingster and Suslina (2003). Let P^ be the law of the observations. We 
consider the probability measures 


Po = Pg X P<? 0 and Pi = P| X P^ , 

where ¥'o (} means the hypothesis of the test applies (no jump) and P@ 1 draws a jump-time 6 with 
6 nk~ l E {1,..., [n/k n \ — 1} uniformly from this set. Therefore, Po represents the probability mea¬ 
sure without signal, and Pi the measure where a signal is present. It then follows that 

1 1 _ 

inf 7 ^(a,6 n ) > 1 - -||Pi -P 0 || TV , > 1 - ^|Ep 0 [ L o,t - l]| 1/2 , 

with Lo,i = cflPi /(/Po the likelihood ratio of the measures Pi and Po- For the validity of (92), it thus 
suffices to establish 


Ep 0 [^6,i] l as n —> oo. 


(93) 


To this end, for given 6 we denote with u® nkri = a f° r * e ^ 2 , v= ( u ]^ ~ 

1 /2 

1 )r n . We first perform some preliminary computations. Denote with (py(x) the density function of 
a Gaussian random variable Y, not necessarily standard normal, and for a, be [n/k n \ - 1} 


I a ,b(x,y ) := PI 

i€I 2 


i r V£i(uM*+v$(yi) 




Xi 


n 

iel 2 




Then, we have that I a<b := f I a ,b( x ,y) TLgz 2 ^(x^dx,JIiez 2 P^iy^dyi 
[n/k n J — 1}, consider 





2 


1. Next, for a E 


Observe that for a standard Gaussian random variable Z and s,t E HI, |s| < 1/2: 


E[exp(sZ 2 + tZ)] 


(1 - 2„r 1/2 exp (j/Fj) ■ 


(94) 


This, together with the inequality 


f T n 1 

Cokn (k n A n ) 2a <r n Y, - ^n)A n ) 2a < k n (k n A n ) 2 “ 

t=0 

for some constant Co > 0 and routine calculations yield for some Co < C) < 1: 

II a := f II a (x) (p^(xi)dxi < e C| L l (fc n An) 2o (i + o(l)). 

J i£Z 2 
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With all the preliminary calculations completed, we are now ready to derive a bound for 

[n/k n \-l 

E Po [LgJ - 1 = £ P (6nk~ l = a)P(0nfe“ 1 = b) (J a , 6 - l) 

a, b=l 
a^b 

[n/fenj—1 

+ £ =a) 2 (// a -l) 

a=l 

where the first sum vanishes. For an appropriate choice of > 0 in (85), we have that k n (k n A n ) 2a = 
C/ log (n/k n ) for some C 2 < Cj -1 . Since P(0n/c“ 1 = a) = k n A n , we thus obtain 

[n/kn J 

lEpjL^] - l| < J2 = a) 2 (e ClkniknAn)2a - l) 

a= 1 

= (1 + o(l))k n A n e c ^{k n ^ n ) 2a _ ( 95 ) 


We conclude (93) using 

k n A n e Clkn( ' knAn ' >2a = k n A n exp (C/ C 2 log (n/k n )) = (fc n A„,) 1_Cl C2 = o(l). 

Case a < 1/2: The only time we make use of the condition a > 1/2 above is in Step 3 to obtain 
n/kn = o(l). The necessity of this relation is due to the large number of blocks n/k n , when op¬ 
erating with the entropy bounds. To establish the lower bound, this constraint can be removed by 
granting the experimenter even more additional information what is briefly sketched in the follow¬ 
ing. Indeed, suppose we know in addition that On € { k n . 2 k n , ..., l n k n } where l n = n [ -C n//,:.„, 
I > 0 arbitrarily small but strictly positive and such that l n € IN. Using the sufficiency argu¬ 
ment of Step 2, we can gather all the information contained in (r/i);„A: n <i<n in one single average 
(n — (l n + 1) k n )~ 1 Y^i=i n k n +1 T lf ■ Then, one can repeat Steps 3 and 4, subject to the weaker condi¬ 
tion l n /k n = o(l). Selecting 1 > 0 sufficiently small for each 0 < a < 1, this is always possible. 
Substituting nk~ l by l n in the sum and (squared) probability in Step 4, we obtain instead of (95) 

|Ep () [^0,1] - l| = (1 + o{l))l~ l e ^iMfcnA„) 2a _ 

For an appropriate choice of cp > 0 in (85), k n (k n A n ) 2a = C 2 log (n/k n ) with C-> < (Cf . Hence, 
we conclude that the term tends to zero and the lower bound in Step 4 gives the same minimax detec¬ 
tion boundary. 

Let us now touch on the general case with some cl > 0 and sequences L n . We can divide for¬ 
mulas in (91) by a 2 to rescale. Exactly the same arguments lead to lim n _ >oc inf/, 7 ,/, (a, b n ) = 1 for 
k n given in (85) with b n < L n (k n A n ) a a 2 _, which gives the general result. Finally, we remark on the 
regularity alternative Hf. The proof is along the same lines as for jumps where instead of a jump 
of size L n (k n A n )° at unknown location, we observe a sudden, more regular increase in af of size 
L n (k n A n ) a+a , where we exploit the regularity a' in sets Hence, the jump gets replaced with a 
gradual regular increase. Then, the arguments are almost identical. This also highlights the fact that at 
(or below) the boundary b n , the different alternatives H\ and Hf are not distinguishable. □ 
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Proof of Theorem 4.2. Using similar arguments as in the proof of Theorem 3.2 and in particular 
Proposition A.4, one derives for 


V n .i = 


If Ej-i-ts+i m x ? - E [( A U> 2 ]) - f SfSi -gAW 2 ])! 

” EtSi(A ?^) 2 


k n j' 

kn<i< n — that under the alternatives H\ and Hf, it holds that 

VKVn,i = 0 P ( 1 ), K<i<n-K. 

Based on (96), a simple estimate yields 

(l-Op(l)) 


n 


v* > —V n ^ nd j + — 

^n 

>-Or({K)~ 1/2 ) + ^ 

" / -n 


r 6 

rO+K A n 

/ (rjds- 

/ °8 ds 

Jo-K A n 

Je 


o', 

2 f e + k 2 

(j~ds— / o; ds 

Je 


e 


(1 -o P (l)) 


sup 0 <t<i <?t 


2 ' 


Observe that in order to prove 7^0 (a, b'',) -» 0, it suffices to show that 

P(U n * > 2C'V 21 °g("£)/*&) ->• 1 under Hi or #?, 
and P(V„* < 2 C?\J 2 log(m^)/—> 1 under ido- 

Case i?i: Using that (of — Acrf ) tg [ 0 i i] £ S(a, L n ) and Aoy > 0, we get 
n r e [9+K An 

— sup / a 2 ds — / of ds > - 2 L n (k° A n ) a . 

t>o Je-k%A n Je 

Hence (98) follows for a' = 0 in (34) with (97). 

Case idf: For erf £ (a. a', 6 *, L„, A:„) , we have that 

ol+h, > of + b n h a ' or tj 2 e+h < a} - b n h a ', for 0 < h < 2k n A n . 

It follows that 

r$+ 2 k°A n 

le+k° A n 

Therefore (98) follows for (34) with (97). 

Case Ho: Under the hypothesis we employ the upper bound 

U n *< max V n}i + L n {k° n A n )° 

K<i<n-K 

to prove (99). (33) implies 

2 C°y /2 log (rn° n )/K > 2 y/ 2 \og(m° n )/k^ + L n (k% A n )°, 


n 

uo 

‘ Vr n 


(of - of_ fc o A J ds 


>b n (K A n ) a 


(96) 


(97) 


(98) 

(99) 


and hence it suffices to show that P(maxj.«<j< n _^ V n ,i < yj 2 log(m£)/A£) —>• 1. This, however, 
follows from a direct adaption of Theorem 3.2, which completes the proof. □ 
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Appendix D: Proofs of Section 4.2 

Proof of Proposition 4.5. We use the following elementary lemma to prove Proposition 4 . 5 . 

Lemma D.l. Let f[t ) and g(t ) be functions on [0, 0] such that f(t ) is increasing. As long as f{9) — 
f(9 — 7) > sup o <£<0 \g(t)\ for some 7 E [0,0], we /rave that 

argmax o < t <0 (/(f) + 5(f)) > 0 - 7. 

A« analogous result holds if fit) and g[t) are functions on [0,1] and f(t) is decreasing. 

For 0 E (0,1) define i* = \ 9n\ , the smallest integer such that i* A n is larger or equal than 0. While 
(of )/ e [o.i] is the squared volatility process containing one jump at time 0, denote by (of )te[o,i] the 
same path without jump, such that 


a iA n = °iA n +6t(i>i*) 

with jump size <5. Without loss of generality, we assume 5 > 0. Define 

0 , if i + k n < i*, 

f(iA n ) = \ (i + k n - i*)kf 1 ^ 2 5 for i = i* — k n ,..., i* , 
\fkfb if i > i*. 


( 100 ) 


and (/(f))te[o,i] the associated piecewise constant increasing step function. For i = k n ...., n — k n : 


i+kn 


X - E n{A^Xf 

j=i-k n +1 j=i +1 


i+k ,, 


= { E - nn(A]X) 2 }) - E {n(A]X) 2 - E[n(A]X) 2 })} 

j=i-k n +1 j=i +1 

i i+k n 

+ { £ ( E M7"W] - SjtJ - E 7M7” W - 4aJ) } 

j=i—kn +1 i=*+l 

j l+fcn i+fcn j+fcn 

+ { X] ff iA„ - E ~ X a jA n ) ='■ A i + B i + C i ~ X ( a JA n ~ 

j=i-fe„+l j=i +1 i=t+l i=i+l 


with the obvious definition using the curly brackets. Thus, for the step function (ff(f))te[o.i]> with 

( i i+kn i+kn \ 

E n(Aj'X) 2 — E n(X>jX) 2 + E ( 4 a„ - 4 aJ 
J=i-fc n +l i=i+l J=i+1 / 

for i = k n ,... ,n — k n and g(iA n ) = 0 else, we have that 


s/kn 9 (f An) — i + B n i T C n ,i T- D n { , 
i+k n 

-° n ,i= X ( a JAn - °jA 

j=i +1 


) - (4 a „ - 4aJ 
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i+k„ 


^2 “ a i* An) + (4a„ - VjA n ) ■ 

j=i+1 


Exploiting the smoothness of (cr*) te [o,i] by Assumption 3.1, we obtain 


i+k n 

max I Am I = max I D ni \<K sup |<r s | max \ a jA n ~ A*aJ 

i=l i*—kn<i<i* 0<s<l i*—kn<i<i* LJ 

- j=i+l 

< K sup |cr s |fc^ + 1 I n _n = £> a .s.(\An lo g( n )) , 

0<s<l 

with some constant K by (16). Proceeding similarly as in the proof of Theorem 3.2, it follows that 

max I A n>i + B Ut i + C n A = £>p(\An log(n)) . (101) 

i=l,...,k n 

Altogether, we conclude that 

sup |p(f)| = £>p(^log(n)) • (102) 

te[ o, 0 ] 

Finally, using (100), we see that f(iA n ) > \g(iA n )\ > 0 holds for each i = i* — k n / 2, 
with probability tending to 1. In particular, 

v n,i = |/(*A n ) + g(*A„) | = / (*A n ) + sign(p(zA n )) |g(*A n ) | (103) 

for those i. Furthermore, 

f(i*A n ) - f(i*A n - t n ) = ['y n n\6k~ 1/2 for 7 n E [0,k n /(2n)]. 

Thus, we choose 7 n such that 

y/k n \og{n 


5n 


= o(7n). 


(104) 


Now the assumptions of Lemma D.l are fulfilled, and we obtain, with probability tending to one, 

i* An > argmax i=fcni ^ At* A n > **A n - 7„, 
through an application of Lemma D.l together with (103). A similar argument for i > i* shows 
i* An < argmax i=i » )ra _ fcn A n < i*A n + 7 n , 
from which one obtains 1 0 n — i* A n \ = Op+ n ), which completes the proof by definition off*. □ 


Proof of Proposition 4.7. Suppose that 

y/kn5 n = o(v / log(n)) (105) 

and we have a consistent estimator 9* for 0. Define 

( 6 *n— 1 6 *n+k n \ 

AN 2 - £ AN 2 )- 006 ) 

j=6*n—k n j=6*n-\-l 

Using the statistic T~,, we can now test for jumps in the volatility a 2 . Note that due to (105), it readily 
follows that this new test has a detection boundary b n = o(+log(n)/\/J+). This, however, is a 
contradiction to Theorem 4.1, and hence such an estimator 9* cannot exist. □ 

Finally, to prove Proposition 4.9 we may proceed exactly as in the proof of Proposition 4.5. 
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Appendix E: Proofs of Section 5 
Proof of Theorem 5.4. 

Lemma E.l. Decompose Xt from (1) in At = / ( j a s ds and Mt = Xq + rr s dW s . Define for 
i = 2,..., n: 

Rn,i = n 2 (((A"M) 2 - (A tiM) 2 ) 2 - |((A”M) 4 + (A^M) 4 )j , (107) 


ul = 


■ max 

V 77- — 1 m=2,...,n 


m 

E 

i =2 


Rn,i 


t:u Rn 

n — 1 


(108) 


r/ie« |i4 - ul\ A o. 

Proof. The inequality (68) implies that 


\v^-ul\< 


y/n — 1 


max 

m= 2,...,? 


i=2 


n 

Rn,i )| + IE (Qn,i Rn,i) ^ 


i=2 


and it suffices to discuss the first addend. To this end, we start with the decomposition 

n 2 ((A"A ') 4 - (A”M) 4 ) = n 2 (4(A”M) 3 A™A + 6 (A? M) 2 {A]A ) 2 + 4A"M(A”A ) 3 + (A”A) 4 ), 

and from |A"A| < Kn~ l and E[| A/M\ p ] < Kn ~ p / 2 by boundedness of all coefficients, it is easy to 
see that the only term to discuss is the leading one, i.e. n 2 (A/M f'A 1 - A. Under Assumption 5.1, one 
may approximate both processes inside A "A and A ”M by their value at (j — 1) A re , and the error due 
to this approximation has exactly the correct small order. For the remaining approximated versions 
of n 2 (A”M) 3 A”A, standard martingale techniques prove the required bound. The term involving 
n 2 ((A"_ 1 A ) 4 — (A7_ 1 M) 4 ) is discussed analogously. Finally, 

n 2 ((A”_ l XXjX ) 2 - (AJ.jMAjM) 2 ) = n 2 ^A”A((A J "_ 1 AF) 2 A”A 

+ 2(A]_ 1 M) 2 A]M + 2A]_ l MA]_ 1 AA]M + 2A]_ 1 MA™_ 1 AA™A) 

+ A"_ 1 A((A^M) 2 A”_ 1 A + 2(A]M) 2 A™_ 1 M + 2A 1 jMA 1 ]j_ l AA 1 jA) 

+ XjAXj_ 1 A(A , jAA , j_ 1 A + 2A”MA"_ 1 M)). 

Again, the only relevant terms are those involving one factor of increments of A and three factors of 
increments of M. The same reasoning as above gives the required result. □ 

Proposition E.2. On the hypothesis of Testing problem 5.2 and Assumption 5.1, we have the func¬ 
tional convergence 


1 

y/n - 1 


|ntj 

^(i^-E[(A?p) 2 ]) 


i =2 




v s dB s , 


(109) 


weakly in the Skorokhod space with v 2 = (80/3)<rf and ( B s ) a Brownian motion independent of IF. 
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Proof. Denote the set of time instants [iA n , (i + l)A n ] subject to discontinuities in (ot)t >o with J n . 
Since | J n \ < oo for n —> oo, we have that 


1 V—_ P 

== ^2 Rn,j > 0 


s/n - 1 


jtJn 


as n —> oo on Assumption 5.1 (ii). Therefore, we may assume throughout this proof that (39) is 
satisfied, for simpler notation on [0,1]. Standard localization arguments allow us to assume that (41) 
and (39) apply on [0,1] with global constant K. Ito’s formula gives 


Ci = (A”M ) 2 - 


o‘:ds = 2 I (M s — dM s , 


r*A„ 

'(i-l)A n J(i- 1)A„ 

which forms a martingale difference sequence in i = 1,,n. For its quadratic variation, we obtain 

riA n 

[C, Cji = 4 / (Ms - A%_daJ 2 d[M, M\ s 
!(i- 1)A„ 

[•iA„ 

= 4 

'(i-l)A n v d(i- 1)A„ 


[■iAn , rs . 2 

/ ( / cr t dW t ) Os ds . 

J (i—1) A n ' J (i—1) A n ' 


Applying Ito’s formula to (A”M ) 4 yields 


-(A”M ) 4 = - 


riA n 


( [ otdWtVosdWs+A / " ( j o t dWt ] 2 o 2 s ds 

n y J(i- 1)A„ ' J (i-l)A n ' v J (i— l)A n ' 


(i-l)A„' J (i—1) A 
= Ni + [C, C]i , 

with a martingale difference sequence (Aj)i<j< n . We decompose 

riA 


Rn.i — Tl 


/ x 2 9 

[Ci - + / (a 2 - a 2 _ A J ds) - -((A?M ) 4 + (A^_ 1 M) 4 ) 

J (z—1) A n 

— h + Hi + Uli , 

with I* = n 2 (C 2 - [C, Cji + Cti - [C, C]i- 1 - 207^-1 - N - N t - i) , 

iAn 

' ~s ~ a s- A n ) us > 

•i(i-l)A„ 

^A n \ 2 


( 110 ) 


Ili = 2n 2 (Ci - Q_i) / (cr 2 - ct 2 _a ) ds • 


IIP = rd 


(ct 2 - ct 2 _ a J ds) 
(i—1)A„ 7 


The terms (IIi)\<i< n and (///,) i<,<„ are asymptotically negligible. We start proving that 

|nij 


sup 
*G[0.1] ' “ 


^2 If = Cp(\/n) • 


(Ill) 


It suffices to prove that 


|ntj 


n 


~ 1//2 E sup V2n 2 (C) - Ci-i) 
te[o,i] j_2 


(‘iAn 


(i 1) An 


— ^s-aJ ds 


0 . 


( 112 ) 
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As by Assumption 5.1 we have that 


J (i-l)A„ 

( 112 ) is implied by 


(■ IA n piAn pl^n 

(Vs-Vs-A n )ds= (y s -v s -A n )ds+ \ {q s - Qs-A n ) ds , 

J (i— l)An 


' (i—l)An 


n 


n 


-V 2 E 


_1 / 2 E 


iMl 

sup IE 2 n 2 (Ci-Ci 




i-i) / (vs-v s -a n )ds 

J(i- 1)A„ 

AA,. 


*e[o,t] 1 i=2 

[nt\ 

sup Y] 2n 2 (Ci - Ci- 1 ) / (£>s - Qs—An) ds 

*e[o,i] 1 j_ 2 J (i— 1 )A„ 


0 , 


0 . 


(113a) 

(113b) 


By Cauchy-Schwarz and Jensen’s inequality we derive that 

, L nt J AA„ 

sup | - Ci 


2n 2 E 


ie[0,l] j_2 


i-u / (v s - is s -A n ) ds 

J (i- 1)A„ 


n EL EL / \ 2 1/2 

<2A VfCi-Q-ifV / Jds 


«=2 
n 


1/2-1 


2n 2 ( ^(E[(Ci-Ci-i) 4 ]) 1/2 A n E 

*d=2 


l/2\ 1/2 


ft ft /».} /\ 

[ | /* AAA7i 

V(Q - C,_i ) 2 V A n / - i / s _ A „) 2 ds 

K-^-aJ dsj J J 

2n 2 (^(E[(Q-Q_ 1 ) 4 ]) 1/2 A3 /2 E[ (^ - ^_ A J 4 dsj V2 ) V2 

V , , ^ df7 — 1)An J 7 


if =2 

= 0(n 2 Af/ 2+e ) =o(n 4 / 2 ), 


jA n 

j A n 


since 


sup| s _ t |< An (E[(i/ S - ^) 4 ]) 1/4 < A,V 2+t for some e > 0 and E [{Ci - C t - 1 ) 4 ] = C>(A 4 ). 
This proves (1 13a). In order to verify (113b), we exploit independence of (f? s ) s e[o,i] an d (Ws)se[o,i] 
by Assumption 5.1. In particular, we have 


E 


piA n 

Ci / {q s ~ Q s -A n )ds a(g s , s E [0,1]) U -E(i-i) A „ 

J (i-l)A„ 

r^A n p 

/ {Qs - Qs-A n ) ds E Ci <j{q s , s E [0,1]) U Tpi_ i )Aii 

■Hi- i)A„ L 


= 0 , 


(114) 


and analogously for the term with Q-i, such that the martingale property follows by iterated expec¬ 
tations. Thus, by Burkholder and Cauchy-Schwarz inequality, with some constant K: 


E 


sup ( 2 n 2 (Ci - Ci- 1 ) / (q s - Qs-A n ) ds) 

t(=[0,l] i=2 ^ (i— l)A n 

77. 

KKn^W.UCi-Ci-^fi " (q s — Qs-A n ) ds) 
A—n ^ 'J(i-l) An ' 


2 i 
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< K ^[(Q - a_!) 4 ]E[(A^) 4 ]) 1/2 = 0(n l - 2a ) , 

i=2 

where we used (42) in Hypothesis 5.2 in the last step. This implies (113b) and thus (112). Next, we 
show that 


Decomposing er 2 


i L ” tJ / \ 

sup [ nii ~ E [( A ^) 2 ] ) = °p(Vn) • 

te [o,i] j_2 

u s + Og again, the main point is to guarantee 

[nt\ 

sup ^((A?p) 2 -E[(A”p) 2 ]) = o P (v / n), 

[o,i] j _2 


(115) 


(116) 


as the other terms may be treated as for (//,),<,<„ above. (1 16) is ensured by Assumption 5.1 ( iv ). 
Denote by ( 7 , = Cf — [67, 67] { —Ay, which forms a martingale difference sequence (Ui)\ <,< n . Observe 
that 

m m m —1 

^ = n 2 ^((7, + Uj—\ - 267,67,_i) = n 2 ^ (2(7, - 267,67,.!) + U m - Uy , 

i= 2 i=2 i=2 

which is a martingale sequence plus an asymptotically negligible remainder. We establish a functional 
stable central limit theorem for 


\nt\ 2 \jit\ 

^Z n ,, = ^=^(2(7,- 267,67,_i) (117) 

z=2 * i =2 

based on Theorem 3-1 by Jacod (1997). Denote with Q^ n = 7 r ,A n = p(y s , W s , q s ; s < zA n ). Since 
we consider a martingale directly, the drift condition in Theorem 3-1 by Jacod (1997) is trivial. Thus, 
conditions 


V nt \ pt 

^E[Z 2 ,|£,_ 1>n ] A / v 2 s ds (118a) 

i=2 J ° 

with a predictable process (u s ) s >o, for all e > 0: 

fntj 

^E[Z 2 ) ,l {Zn . >£} |g,_ lin ] AO, (118b) 

i =2 

[nij 

AO, (118c) 

i=2 

for all bounded (7,-martingales {Mt)t >o with A4o = 0 and [W, M] = 0 or for M. t = Wt, imply the 
functional ((7,)-stable limit theorem 


Vnt] < *\ 

yz,/4" f v,dB, 

i =2 


( 119 ) 
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with (Bt)t >o a Brownian motion defined on an orthogonal extension of (Q. Q. (Gi). P). Ito’s formula 
can be used to determine the relation 

Var((A?M) 4 |£_i,n) = 96 A 4 {a 8 (i _ 1)An + O a . s .{n - (aAl/2) )) , 

and similar ones for other powers, which also readily obtained by the mixed normality A”A/ ~ 
(Tg ds ) and approximation of a to obtain measurability with respect to G %-1 .« • Using 
the form of Z n i from (117), we derive for the left hand side of (118a) 

\nt\ \nt\ 

= n 3 J2 (^[Ui\Gi-i,n] +4E\C?Cl l \G i - 1 , n ] -8E[Z7 i C , i C i _i|ai-i,n]) • 

i =2 i =2 

For a simpler notation we consider the three terms consecutively. First, 


n 


\nt\ 

i=2 


-1 ,n\ = n 


\nt\ 

'’E 4 (( A ?-i M > 

i=2 
|ntj 


/■(*- 1)A„ . 2 

2 - / <7 2 d s ) E[C' 2 |^_ 1> , 

J(i-2)A n 7 


8 ^E K*-2)a„ + a.,(n- ( “ A1/2) )) ((A^M) 2 - / a 2 ds) 

j—2 7 —2)A n 


16 / (J ° ds. 


The last convergence is very close to the usual analysis of the variance of realized volatility, see 
Section 5 of Barndorff-Nielsen and Shephard (2002). We frequently use Ito’s formula, especially for 
the second term: 


\nt.\ 


[ntj 


n 


i=2 


i=2 


f‘iA n 


= n 3 ^4E ((A”AT) 2 — / a 2 s ds) +-(A?M) 8 


(i-l)A„ 


4 4 


- 2E 


, /"iA„ 

((A/M) 2 - / 

' «/ Tz— 1 


(i-l)A„ 


2 2 


aids) ^(A/AT ) 4 


Si- 


i—l,n 


iMl 


= n 


'* E 4 Ki-DA„ + 0« «. (n- ( “ A1/2) )) (60 + 105 ■ - - 156 • -) 


i=2 


32 r „ , 


Finally, EfCACiCi-i 1= 0, since 
E[C7 i C' i |ft-i,n] =e[((A"AT) 2 - 


*A n \ 3 2 y /* i A r, 


'(*-l)A„ 

vanishes. Thereby we derive (118a) with 


-^(A?M) 4 (A?AT) 2 - 


(j 2 ds 


' (*—l)A n 


Si- 


r; 2 = (80/3) . 

The Lindeberg criterion (1 18b) is ensured by the stronger Lyapunov condition 

\_nt\ [ntj 

^E^lft-!,*] =n 6 ^E[(2U i -2C i C i _ 1 ) 4 |^_i, n ] = 0 P (A n ) 


i—l.n 


( 120 ) 


i=2 


i=2 


0. 
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Condition (118c) for Ad = W follows with Ito’s formula by the fact that E[(A ™W) P ] = 0 and 
E[(A iM)P] = o P (l) for all odd p > 1, using the usual approximation. Ito’s formula also yields for 
any p and bounded martingales (M)t>o that (J^Al a cr s dW s ) p J'(* A ]) A dM s equals a negligible 

remainder plus the term a p s d\W, M]s which vanishes, such that (118c) is satisfied. This 

completes the proof of (1 19). □ 

Theorem 5.4 follows with Lemma E. 1 and Proposition E.2 by continuous mapping theorem. 

Proof of Proposition 5.6. The condition a > 1/4 guarantees that 


n 


- I/2 E 

i =1 


/ 2 2 \2 JP n 

l CT (i-l)A„ “ a (i- 2)Aj > U : 


and analogously with fourth moments, since n 1 / 2 2a — > 0. Hence, a consistent estimator of rj/ 2 ) A 
can be used to standardize ( Q n ,i)K n +i<i<n and {R n ,i) K n +i<i<n- Thereto, we write 
R n ,i = (&fi- 9 )a 1 Rn,i with the estimator and K n from (49) and 


Ul = 


max 


y/n — 1 m=K n +1 ,... 


E 

i — K.n ~\-1 


Rn.i 




K n +1 


Rr, 


n — 1 


The proof is now traced back to the one of Theorem 5.4. Set 

A = |2)A n - a (i~ 2)A n / 2 } • 

Since (cr 2 ) s > 0 is bounded from below and for K n —>• oo the estimator is consistent, based on Markov’s 
inequality we obtain analogously as in the proof of Equation (22) of Vetter (2012), that P(^) = 
e>(n -1 ), such that 


p( n a)>i- x] p(a c )= i-o(i). (121) 

i=K n -\-1 i=K n -\-1 

Then, we use decomposition (110) for R n ,j and an analogous one for R n ,i- By the fact that is 

Qi~ 2 ,n = R(i - 2 ) A „ _ nicas u rah I c, the leading term of the latter has a martingale structure again. Similar 

~ + — + P 

estimates as in the proof of Proposition E.2 yield that | Un — f/), —> 0 for 


ul 


1 

: max 

yn - 1 m=K n +l,...,n 


i=K n +1 (z—2)A n 


Yhi=K n +1 RnA(A) 

(n - 1 )<4_ 2)An 


Observe that the analogue of the leading terms ( I,) in the decomposition (110) for decomposing ll}, 
remain a martingale difference sequence (plus a negligible remainder), since o‘^_ 2 )A ' s Si- 2 ,n = 
J 7 (j_ 2 )A n -measurable. Analogous terms to //, and III, in (111) are handled as above. This readily 
proves Proposition 5.6. 


Proof of Theorem 5.5. First, let us state two preliminary Lemmas E.3 and E.4. 
Lemma E.3. On Assumption 5.1 it holds that E[|l/1 — f/JljP 7 ] = Op(l)- 
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Proof of Lemma E.3. Introduce the interpolated volatility 


cr. = 


0, if t £ [0, K n A n ), 


df KnAn , if t G ( iK n A n , (z + l)/v„A n ] and 1 < i < ra/AT n . 

Observe that with a standard Brownian motion (B s ) s >o independent of IF, we can write 


( 122 ) 


Snt = / vjdB s , t £ [0,1], 


(123) 


setting in (50) V K n A n Zi = B( i+1} KnAn - B t KnAn . Let 6 n -t OandputAi = (supo^, \of-of\ < 
5f}. Then it follows similarly as for (121) that P(^) —> 0. Since A n £ T, we thus obtain 


p(E[|W-v;t||^] > 8 n ) <p(E[|y f -v;t|i(^ n )|^] ><s„)+o(i). 


(124) 


Moreover, by (123), the triangle, Markov, Burkholder and Jensen inequality, we obtain with generic 
constant K: 


P(E[|W - V^\l(A n )\B] > 5 n ) < P(E 


sup 

0<Z<1 


(erf - Og)dB s 1 (A n )\F 


> 5, 


1/2) 


< K5~ l ¥, 


sup 

L 0<t<l 


/( 


°s - I < S l)dB s 


< ICS 


-1 


(/ _CT s) 21 (ks-4l - 


(125) 

□ 


Combining (124) and (125), the claim follows. 

Lemma E.4. Let (B s ) be a standard Brownian motion independent of T. Grant Assumption 5.1. The 
distribution ofV f conditional on T is uniformly continuous on compact sets K with 0 0 1C, i.e; for 
any e > 0 there exists a 5 > 0( depending on K) such that 


sup sup |P(W < x\JF) — P(V"f < x + y\JF) I < e. 

x&K\y\<8 ' 


(126) 


Proof of Lemma E.4. Denote with 


w t = [ o^dB s -t [ crfdB s , i £ [0,1], 

Jo Jo 

and for r > 0 we partition [0,1] = [0, r] U (r, 1 — r] U (1 — r, 1], and (r, 1 — r) = U^Tq (tj, A+i], 
where to = r, td = 1 — r, and for some c > 0: \L + i — ti\ < cd~ l . Since info<t<i of > C > 0, we 
have 


c. = mm 

r<tn<l—r 


E [ w l I- 77 ] = r< ™“_ r (( 1 “ ^) 2 j o*ds + t? £ o 8 s ds 

> min (1 — ti)tiC A = (1 — ?’)rC 4 > 0. 
r<ti<l~ r 

Similarly, since sup 0 <t<i of < I\, we have niax r <( i: <i _ r JL[wf\F] < < 00 . Next, the uniform 

bound and Burkholder’s inequality yield with generic constant K on Assumption 5.1 


sup min I wt — Wt I > x\J 7 ) < P( max sup I wt — Wt I > xlJ 7 ) 

r <i<l_ r l<i<d 1 ' v l<i<d-l *,<*<*..,, 1 ' 




( 127 ) 
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d ~ 1 IV /A+i \ p / 2 
<Kx~ p ^2e 


i —1 


tig ds 


F 


< 


AV P E[ sup |<r t 4 | p |A] " ^) P/2 ^ tf* _p d~ p/2+1 - 


0<t<l 


i=l 


In a similar manner, we also derive that 

sup \wt\, sup |uy| > x\F) < K x~ p r p ^ 2 . 


(128) 


0 <t<r l-r<t<l 

Using (127) and (128), we obtain the lower bound 

P(l/1 < x\F) > P(max Fwi < x — y\F) — y~ p d~ p ^ 2+1 — x~ p r p ^ 2 , (129) 


and the upper bound 


P(U f < x\F) < P (max ±w ti < x\F^j. 


(130) 


Combining both (129) and (130) in turn yields 

|P(U f < x + 5\F) - P(W < x\F) | 

< |P( max ± < x + (ilj 7 ) — P( max ±Wi < x — <5|A')| + d~ p d~ p ^ 2+1 + x~ p r p ( 2 . (131) 

l<i<d l<i<d 

Since 0 < < oo for any r > 0, an application of Lemma 2.1 in Chernozhukov et al. (2013) yields 

that 


P( max ±< x + 5IJ 7 ) — P( max ±wt t < x\F) I < C(ff)5 \/1 V log(d/5), (132) 

l<i<d l<z<d 

with C{<^f) < oo depending on c,^. Since K is compact and 0 0 JC, we have sup,,^ x~ p < C P (IC) < 
oo. For p > 2, select d = 5~ q with positive q satisfying 2(p + q) < pq. Then combining (131) with 
(132) yields 

|P(U f < x + <5|A) - P(U t < x\F)\ < 2C(o r ± )<5(ylog(<5) + 1) + C p (K)r p/2 + S~^ p+q)+pq/2 . 

Given e > 0, we first select r > 0 such that C p (JC)r p / 2 < e/2. This defines C(y/r). We may then 
select 5 > 0 such that 2C'(o/ = )()((jflog(5) + 1) + §~( p+q ) + P q / 2 < e/ 2 , hence the claim follows. □ 

We are now ready to proceed to the proof of Theorem 5.5. Let x > 0. Due to Lemma E.3 and E.4, 
there exist 5 n , Li —> 0 such that 

P(U f < x|A) < P(v£ < x + 1^ - U n f || F) < P^ < x + 5 n \F) + o P (l) 

— — x \-^) + e n + Op(l) < P(Ut < x\ T 7 ) + 2e n + Op(l). 

By Lemma E.4, q a (V^\F) is continuous for a > 0. Hence we conclude from the above that for any 

a > 0 


q a (V^\F)-q a (V^\F)\=0 P (l). 


( 133 ) 
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Together with Theorem 5.4 and (133) this yields 

P(K f < q a (V^)) ->• P(W < q a (V'\r)) = a, (134) 

which completes the proof. 


Proof of Theorem 5.7. Denote by (^)i<»<n a sequence of i.i.d. standard normally distributed 
random variables. Let (L,;) i<i< n be an i.i.d. sequence of random variables with bounded support and 
a symmetric distribution and such that E[J7j] = 0, E[£/ 2 ] = 1. We then consider the following special 
model for the volatility a s . 

a s = |l + 0„Z7i|, s € [(* - l)/n, i/n) , (135) 

and the associated observed process (A"X = fl i-l)/n asdWs = I 1 + The 

sequence of parameters # n is introduced here to determine the roughness by fluctuations of a s . We 
shall later see how it is directly related to d n in (41). 

Theorem E.5. Suppose we make observations (A”X)i<j< n in model (135). For parameter sequences 
•&ni t?n, 0 i t?n,i £ [0,1), consider for 0 < 6 < 1 t/re null hypothesis Hq and alternative Hi 

FL a : d n = d n: o, for 1 < i < n, 

Hi: 'd n = d n: i f t?n,o> for L#nJ + 1 < i < n. 

Then for b n = o(n -1 / 2 ) and \$n 0 ~ d ni\ — b n we have 

inf 7^ (a, b n ) —> 1 as n —>■ oo. 

i> 

with the notion of the global testing error in Equation (6). 

Proof of Theorem E.5. First, we may simplify the experiment slightly as long as we increase the pro¬ 
vided information. Let us assume that it is known a priori that the time of change is 6 = 1/2, basically 
to simplify the notation. Also, consider the special shifted case where we set i? n) o = 0, and hence only 
the change d nt i is of interest. The general case readily follows by a simple translation argument. Since 
fin ,o = 0, the sample (A”A') 1< j < ^ n / 2 j does not carry any information (ancillary statistic) about the 
parameter d n y, and it suffices to consider observations (A™X) i n /2j+i<j<n by sufficiency. We shift 
indices such that we consider in the sequel the two statistical experiments 

£o : Observe 

£\ - Observe {^}i<j<Ln/ 2 j with Yj = |1 + 'd n yU j \Z j . 

We denote the associated (overall) probability measures with Qo resp. Oi. Following the steps as in the 
proof of Theorem 4.1 we derive an analogous criterion to Equation (93). Using Pinsker’s inequality, 
we see that it suffices to establish 



= l-o(l). (136) 

With a Lebesgue density fu of U\ , we can express the density fy of Y\ as 

fr(y)= I fv\Ui=u 1 (y)fu(ui)du 1 = I 1 - i ex p( on , <f - y>)fu(ui) dm . 

J J V27T |1 + d n ,lUl\ V2(l + VnfUiyJ 
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Considering d n ,\ —t 0, we can drop the absolute value above. In the following we have to prove that 
above specified choices of b n and d n .i imply convergence of the Kullback-Leibler divergence to zero. 


log 





[n/2j 2 

exp ^ — log(l + & nj lUj) - y ((1 + t?n,l UjY 2 - 1 

3 = 1 


fu(ui)dui... fu{u\ n /2\)du\ n / 2 \ J fy(yi)dyi ■ ■. /y(y|n/2j)<*/|n/2j 

= J- ■ J log ( J-J n«p (-' dn ’ lU 3 + + \ ( 2 ^n,i«j - 30*,i«?) 

\ 17 i=i 

+ C>(C,lC + ^n,l U i)) fu{u\)du\ . . . /c/(n Ln/ 2 j)dML n /2j^ fy{yi)dyi ■ ■ • /y(y[n/ 2 j)dy|n/ 2 j 

= /•"/'log ([■■[ n exp ((*$ - + ^y^-(l - 3 y 2 ) + + <y«j)) 

V 17 j=i 

fu(ui)dui ... /r/(n Ln/2 j)d«L ri / 2 j fy{y\)dyi • • • /y(y|n/ 2 j)dy|n/ 2 j 


= J-J log (/•••/ II ( x + ' d n,iuj(yj - 1 ) + ( 2 + yj - 5 y?) 

V 17 i=i 

+ + K,l u j)) fu(ui)dU! . . . fu(.Uln/ 2 \)du ln/2 jj f Y (yi)dyi ■ ■ ■ fY{y\n/2\)dy\n/2\ 

/. 1 . L n /2J / q %2 

= /■•■/ ( y ( 2 + yj-5y*) + 1 ))/n(yi) rf yi• • ■ /v- n (y^j)rfy|n/2j 

J J 3 =1 

= 0(n<i). 


The third order term vanishes by symmetry. Hence, if 1 = o(n 1 ), then (136) holds, which com¬ 
pletes the proof. □ 

Theorem 5.7 is deduced as a corollary of Theorem E.5. Providing the experimenter additional in¬ 
formation can only decrease the lower boundary on minimax distinguishability. We assume that it is 
known a priori that the time of change 0 = 1/2, which puts us into the framework of Theorem E.5. 
Set = n~ a and 'd n ,\ = b}J 2 n~ a ' with a > a 7 . Since a 7 < a reflects a larger fluctuation in our 
setup, we are only interested in the case where 'd n ^ > i) n p in our testing problem. If > ’d n .o, 
then lC,o - Cil = C,i - <o = o(n~ 1/2 ) leads to 


b n n 2a ' -n 2a = o(n 1 ^ 2 ) , 


(137) 


which in turns yields the condition 

b n = e>(n^ 1 / 2+2a, + 7?. 2 ( a a/ )) . 


( 138 ) 









Bibinger, Jirak, Vetter/Nonparametric change-point analysis of volatility 


53 


The claim then follows from Theorem E.5 for b n = n 2( n “ ) V n V 2 + 2a '_ 


Proof of Proposition 5.9. We only show the claim for 'ft, as optimality for a test based on Propo¬ 
sition 5.6 follows in the same manner. We first show that 



(139) 


under the alternative of Testing problem 5.2 for any b' n satisfying (55). Proposition E.2, Lemma E. 1 
and the triangle inequality give 


nO 


> rt -1 / 2 (i — 0) ^E[(A” q) 2 ] — 9 Y. E [( A ^)' 

i= 1 i=n6 +1 


-0 P (i). 


(140) 


For large enough n, this is further bounded from below by 

V* > (1 - 9)9(b' n n 1/2 ~ 2a ' - n l / 2 ~ 2a ) - O p ( 1). (141) 

For 9 6 (0,1), it is now easy to see that (55) implies 

(1 - 9)9[b' n n l/2 - 2a ' - n 1 / 2 " 20 ) -> oo, as n -> cx), (142) 

hence (139) follows. Next, we establish optimality of the test if a. By Theorem 5.5 it is an asymptotic 
level a-test. It is left to show consistency when (55) is valid. From (133) and V 1 = O p ( 1), we 
conclude that 


qi-a(V^\T) = 0p(l) = Op((l - 9)9{b' n n l/2 - 2a ' - n l ' 2 ~ 2a )) (143) 

for any fixed, finite 0 < a < 1. Thereby, ¥(Vn > <J\ -a(Vn Wj) — > 1, which completes the proof. 
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